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FIG. 1 

ID-65 
Clone 3-60 

GTGTTTATGATGAAAAAAGGACAAGTAAATGATACTAAGCAA 

TCTTACTCTCTACGTAAATATAAATTTGGTTTAGCATCAGTAA 

TTTTAGGGTCATTCATAATGGTCACAAGTCCTGTTTTTGCGGA 

TCAAACTACATCGGTTCAAGTTAATAATCAGACAGGCACTAG 

TGTGGATGCTAATAATTCTTCCAATGAGACAAGTGCGTCAAGT 

GTGATTACTTCCAATAATGATAGTGTTCAAGCGTCTGATAAAG 

TTGTAAATAGTCAAAATACGGCAACAAAGGACATTACTACTC 

CTTTAGTAG AG AC AAAG C C AATG GTGG AAAAAAC ATTA C CTG 

AACAAGGGAATTATGTTTATAGCAAAGAAACCGAGGTGAAAA 

ATACACCTTCAAAATCAGCCCCAGTAGCTTTCTATGCAAAGAA 

AGGTGATAAAGTTTTCTATGACCAAGTATTTAATAAAGATAAT 

GTG AAATG G ATTTC ATAT AAGTCTTTTG GTG G C GT ACGTC GAT 

ACGCAGCTATTGAGTCACTAGATCCATCAGGAGGTTCAGAGA 

CTAAAGCACCTACTCCTGTAACAAATTCAGGA AGCAA TAATC 

AAGAGAAAATAGCAACGCAAGGAAATTATACATTTTCACATA 

AAGTAGAAGTAAAAAATGAAGCTAAG GTAGCG AGTCCAACTC 

AATTTACATTGGACAAAGGAGACAGAATTTTTTACGACCAAA 

TACTAACTATTGAAGGAAATCAGTGGTTATCTTATAAATCATT 

C AATG GTGTTCGTCGTTTTGTTTTGCTAG GTAAAG C ATCTTC A 

GTAGAAAAAACTGAAGATAAAGAAAAAGTGTCTCCTCAACCA 

CAAGCCCGTATTACTAAAACTGGTAGACTGACTATTTCTAACG 

AAA C AACT AC AG GTTTTG AT ATTTT AATTAC G AAT ATT AAAG A 

TGATAACGGTATCGCTGCTGTTAAGGTACCGGTTTGGACTGAA 

CAAGGAGGGCAAGATGATATTAAATGGTATACAGCTGTAACT 

ACTGGGGATGGCAACTACAAAGTAGCTGTATCATTTGCTGAC 

CATAAGAATGAGAAGGGTCTTTATAATATTCATTTATACTACC 

AAGAAGCTAGTGGGACACTTGTAGGTGTAACAGGAACTAAAG 

TGACAGTAGCTGGAACTAATTCTTCTCAAGAACCTATTGAAAA 

TGGTTTACCAAAGACTGGTGTTTATAATATTATCGGAAGTACT 

GAAGTAAAAAATGAAGCTAAAATATCAAGTCAGACCCAATTT 

ACTTTAGAAAAAGGTGACAAAATAAATTATGATCAAGTATTG 

ACAGCAGATGGTTACCAGTGGATTTCTTACAAATCTTATAGTG 

GTGTTCGTCGCTATATTCCTGTGAAAAAGCTAACTACAAGTAG 

TGAAAAAGCGAAAGATGAGGCGACTAAACCGACTAGTTATCC 

CAACTTACCTAAAACAGGTACCTATACATTTACTAAAACTGTA 

GATGTGAAAAGTCAACCTAAAGTATCAAGTCCAGTGGAATTT 

AATTTTCAAAAGGGTGAAAAAATACATTATGATCAAGTGTTA 

GTAGTAGATGGTCATCAGTGGATTTCATACAAGAGTTATTCCG 

GTATTCGTCGCTATATTGAAATTTAA 
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MFMMKKGQVNDTKQSYSLRKYKFGLASVILGSFIMVTSPVFADQTTSVQVNN 

QTGTSVDANNSSNETSASSVITSNNDSVQASDKWNSQNTATKD1TTPLVETK 

PMVEKTLPEQGNYVYSKETEVKNTPSKSAPVAFYAKKGDKVFYDQVFNKDN 

VKWISYKSFGGVRRYAAIESLDPSGGSETKAPTPVTNSGSNNQEKIATQGNYT 

FSHKVEVKNEAKVASPTQFTLDKGDRIFYDQILTIEGNQWLSYKSFNGVRRFV 

LLGKASSVEKTEDKEKVSPQPQARITKTGRLTISNETTTGFDILITNIKDDNGIA 

AVKVPVWTEQGGQDDIKWYTAVTTGDGNYKVAVSFADHKNEKGLYNIHLY 

YQEASGTLVGVTGTKVTVAGTNSSQEPIENGLPKTGVYNIIGSTEVKNEAKISS 

QTQFTLEKGDKINYDQVLTADGYQWISYKSYSGVRRYIPVKKLTTSSEKAKDE 

ATKPTSYPNLPKTGTYTFTKTVDVKSQPKVSSPVEFNFQKGEKIHYDQVLVVD 

GHQWISYKSYSGIRRYIEI* 



Sequence description 

A) Length: 1642 bp - 547 aa (full length gene) 

B) Sequence Characteristics: 
Potential leader peptide sequence 
Orf is preceded by a potential Shine- 
Dalgamo sequence. 

ID-66 



Clone 3-5 

ATGATATTGAGACGTCGAACTATTGTTTTATGGCAACTGGGTATCGCCATT 

TCTCTCATTCTTAGTATTCTAGCCTTAAATCTTTATTTCCATAGTACTCCCTT 

GCAAACCAATGCAGCTTTACGGAACCTTGCTCCTTCATTAAACCATCTTTTT 

GGGACAGATGGTTTAGGTAGGGATATGTTTGTCAGAACGATTAAAGGACT 

TTATTTC TCTCTA CAAGTCGGCTTATTAGGTGCCCTTATGGGGGTCATTCTG 

GCGACAGTTTTTGGAGTGCTTGCAGGTTTAGGAAATAGCATTATTGATAAA 

ATAATAGCATGG TTAG TTGATTTGTTTATTGGTATGCCTCATTTGATTTTTA 

TGATTCTCATTTCTTTTGTTGTTGGGAAAGGTGCTCAAGGGGTCATCATTGC 

AACGGCTGTTACACATTGGCCTTCTTTAGCAAGGCTTATCCGCAATGAAGT 

CTATCATCTAAAGAATAAAGAATTTGTCCAACTTTCTAAAAGTATGGGAAA 

AACGCCTTATTATATTGTGAGGCATCATATCCTGCCTTTGATTGCTTCTCAA 

ATTTTCATTGGTTTTATCCTCTTATTTCCACATGTCATCCTACATGAAGCAT 

CAATGACTTTCTTAGGATTTGGGCTCTCTGCCGAACAACCTTCGGTTGGTA 

TCATTCTGTCAGAGGCAGCTAAGCATATCTCTCTTGGAAATTGGTGGTTGG 

TTATCTTTCCAGGACTTTATCTTATTTTGGTTGTCAATGCATTTGATACTAT 

CGGAGAATCTTTAAAGAAACTCTTTTACCCTCAAACTGATCATTTTTAG 
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MILRRRTIVLWQLGIAISLILSILALNLYFHSTPLQTNAALRNLAPSLNHLFGTD 
GLGRDMFVRTIKGLYFSLQVGLLGALMGVILATVFGVLAGLGNSIIDKIIAWL 
VDLFIGMPHLIFMILISFVVGKGAQGVIIATAVTHWPSLARLIRNEVYHLKNKE 
FVQLSKSMGKTPYYIVRHHILPLIASQIFIGFILLFPHVILHEASMTFLGFGLSAE 
QPSVGIILSEAAKHISLGNWWLVIFPGLYLILVVNAFDTIGESLKKLFYPQTDHF 



Sequence description 

A) Length: 822 bp - 274 aa (full length gene) 

B) Sequence Characteristics: 
Potential leader peptide sequence 
Orf is preceded by a potential Shine- 
Dalgamo sequence. 

ID-78 



Clone 3-5b 

ATGACAGAA ACATT ATTAAGCATTAAAGACCTCTCCATCACCTTCACTCAA 

TACGGAAGATTTTTAAAACCATTTCAATCAACACCGATACAAGCGCTGA 

ATTTAGAAATTAAAAAAGGTGAGTTATTAGCTATTATAGGTGCTAGTGGTT 

CGGGGAAGAGTTTATTAGCACATGCTATTATGGATATTCTTCCTAAAAATG 

CATCTGTAACAGGAGATATGATTTATCGTGGTCAATCACTAAATTCTAAAC 

GCATTAAACAGTTGCGAGGAAAAGATATTACGTTGATTCCACAATCAGTTA 

ATTATTTAGATCCATCTATGAAAGTCAAACATCAGGTGCGCTTAGGTATCT 

CAGAAAATTCAAAGGCTACTCAAGAAGGATTGTTTCAACAGTTTGGTTTAA 

AAGAA AGTGA TGGTGACTTGGATCCTTTCCAACTTTCTGGCGGAATGCTCC 

GACGTGTTTTGTTTACAACGTGTATTAGTGATAAGGTTTCTTTGATTATTGC 

GGATGAGCCCACCCCTGGATTACATCCAGATGCTCTGCAAATGGTTTTAGA 

CCAACTACGCTCCTTTGCAGATAAAGGAATAAGCGTTATATTTATCACTCA 

TGATATTGTAGCAGCTAGTCAAATTGCTGATCGTATTACTATTTTTAAAGA 

GGGAAAAGCTATTGAAACAGCTCCAGCTAGTTTCTTTAGCGGAAATGGAG 

AGCAG TTAC AAACAGAATTTGCTAGAAGTTTATGGCGCTCTCTCCCACAGC 

AAGAATTTTTGAAAGGAGTTACTCATGACCTTAGAGGCTAA 

MTETLLSIKDLSITFTQYGRFLKPFQSTPIQALNLEIKKGELLAIIGASGSGKSLL 

AHAIMDILPKNASVTGDMIYRGQSLNSKRJKQLRGKDITLIPQSVNYLDPSMK 

VKHQVRLGISENSKATQEGLFQQFGLKESDGDLDPFQLSGGMLRRVLFTTCIS 

DKVSLIIADEPTPGLHPDALQMVLDQLRSFADKGISVIFITHDIVAASQIADRITI 

FKEGKAIETAPASFFSGNGEQLQTEFARSLWRSLPQQEFLKGVTHDLRG* 
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Sequence description 

A) Length: 804 bp - 268 aa (full length gene) 

B) Sequence Characteristics: 

No obvious leader peptide sequence 
Orf is preceded by a potential Shine- 
Dalgamo sequence. 

This gene was not isolated using the LEEP 
system. However in determining a full length 
gene sequence for ID-76, this gene was 
identified downstream and fully sequenced. 

ID-79 



Clone 3-5c 

GTCCATCTGGGGTGGTTCCCGATTGGTATTTCTTCTCCGATAGGTACT TTGA 

GTCAAGATATTACGTTAGCTGATCGTATTAAGCACCTTATTTTACCTGTTTT 

CACGGTAAGTATTCTAGGCATTGCCAATGTAACTCTTCATACTAGAACTAA 

AATGATGTCGGTACTTTCTAGTGAATATGTCTTATTTGCCAGAGCGCGTGG 

GGAAACGGAATGGCAAATTTTTAAAAATCATTGTCTTAGAAATGCTATCGT 

ACC AGCTATTAC ACTGC A 1 " 1 1 1 1CCTAFI TJ GGAGAATTGTTTGG AGGATCC 

GTTCTTGCTGAGCAAGTTTTCTCATATCCAGGACTAGGGTCTACCCTAACT 

GAAGCAGGACTTAAAAGTGATACACCGCTACTTCTAGCTATTGTGATGATA 

GGGACATTATTTG 11 II 1 GCGGGCAATCTTATTGCGGATATTTTAAATAGC 

ATAATCAATCCACAGTTAAGGAGAAAAGTATGA 

VHLGWFPIGISSPIGTLSQDITLADRIKHLILPVFTVSILGIANVTLHTRTXMMSV 
LSSEYVLFARARGETEWQIFKNHCLRNAIVPAITLHFSYFGELFGGSVLAEQVF 
SYPGLGSTLTEAGLKSDTPLLLAIVMIGTLFVFAGNLIADILNSIINPQLRRKV* 

Sequence description 

A) Length: 495 bp - 1 65 aa (partial gene sequence) 

B) Sequence Characteristics: 
N-terminus has yet to be determined. 
This gene was not isolated using the LEEP 
system. However in determining a full length 
gene sequence for ID-76, this gene was 
identified upstream. 




1 CONTD 
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ID-80 



Clone 2-17 



TTGCGGACAATTACG7TCAAACACAATGAAACGCGATCGTCAAAAAGCGA 

AGGTAGGGCGGTAATGCTTAAAAGATTATTTACTGAAGATGGGGAATTGA 

CAAAGATTAGTCGTCGTTTCGTTTGGATGTTAGTGGTTATCTATTGTCTTAT 

TATTGTC A GG ATGTGTTTTG GGCCTC AAATTATGATTG AGG GG GTATC AAC 

TC CGAAT GTTCAGCGCTTCGGAAGAATTGTAGCTCTTTTAGTACCATTTAA 

TTCTTTTCGTAGTTTAGATCAGCTAACTAGCTTTAAAGAGATTTTTTGGGTT 

ATTGGTCAAAATGTAGTGAATATTTTACTGCTGTTTCCTCTCATTATAGGGT 

TACTATCCCTAAAGCCAAGTTTACGGAAATATAAAAGCGTTATATTACTTG 

CTTTCTTGATGTCTCTTTTCATAGAGTGTACTCAAGTTGTTTTAGATATTTT 

AATAGATGCTAATCGGGTTTTTGAAATCGACGATCTATGGACAAATACCTT 

AGGCGGTCCTTTCGCCCTATGGAGTTATCGAAACATAAAAGGTTGGCTTCT 

AACTATTAGAAAATGA 

MRTITFKHNETRSSKSEGRAVMLKRLFTEDGELTKISRRFVWMLVVIYCLIIVR 
MCFGPQIMIEGVSTPNVQRFGRIVALLVPFNSFRSLDQLTSFKEIFWVIGQNTVV 
NILLLFPLIIGLLSLKPSLRKYKSVILLAFLMSLFIECTQVVLDILIDANRVFEIDD 
LWTNTLGGPFALWSYRNIKGWLLTIRK* 



Sequence description 

A) Length: 579 bp - 193 aa (full length gene) 

B) Sequence Characteristics: 

Possesses a potential leader peptide sequence 
No obvious Shine-Dalgamo, but the , TTG' codon 
may not be the actual translation start point. 
A methionine (ATG) that occurs ~22 codons 
downstream of the "TTG' is preceded by a 
potential Shine-Dalgamo sequence and may 
represent the actual start codon. 

ID 81 



Clone 3-1 
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TTGAAAAATTTAAATCGTTATGTAGTTGCGGTTTCTGGAGTCGTTTTACATT 

TAATGCTAGGATCAACTTATGCTTGGAGTGTGTTTCGTAACCCAATTATCT 

CAGAGACTGGTTGGGATATTTCATCAGTTTCATTCGCTTTTAGTTTGGCTAT 

TTTTTGTCTAGGAATGTCTGCAGCTTTTATGGGACACTTAGTAGAGCGTTTT 

GGTCCTAGGATAATGGGAATGATTTCTGCTATTTTATATGGAGCAGGGAAT 

GTGTTAAC A G G CTT AG CC ATTG AA ACTC AG C AGTTATGGTTACTGTATGTT 

GCATACGGTATTTTAGGAGGAATCGGACTTGGTTCAGGTTATATTACTCCA 

GTATCGACTATTATTAAATGGTTTCCTGATAGGAGGGGACTAGCAACAGG 

ATTCGCTATTATGGGATTTGGCTTTGCTTCTTTAGTAACAAGTCCGCTTGCA 

CAATCCTT ACTGATT AGGATTGGTGTGGGTAAAACGTTTTATATTTTGGGA 

TTAGTATA 1 i I T M " I GTCATG ATGATTGCCTC AC AATTTATTAAAC AACC AC 

CTCAGGAAAAAATAACTATTTTGACTCACGATGGTAAAAAGAATGCTATG 

AATT CACA AATTATCACTGGATTAAAAGCAAACGTCGCTATAAAATCAAA 

AACCTTTTACATCATTTGGTTGACCTTGTTTATTAATATTTCGTGTGGCTTA 

GGTTTAATATCAGCAGCTTCACCAATGGCACAAGATTTAGCAGGCTATTCC 

GCAGAATCTGCAGCCTTATTAGTAGGGGTACTAGGGATATTTAACGGTTTT 

GGACGTCTGTTATGGGCAAGTCTCTCTGACTACATTGGACGCCCGTTGACC 

TTTATAATATTATTTATTGTGAACTTTATTATGACTTCTAGTTTATTTTTGTC 

ATTCAATG CTATT GTATTTGCAATAGCGATGTCTATTTTAATGACTTGTTAT 

GGTGCAGGTTTTTCCTTATTACCTGCTTATCTAAGTGATATTTTTGGAACAA 

AGGAATTAGCTACTTTACATGGTTATAGTTTAACAGCATGGGCAATAGCAG 

GTCTGTTTGGGCCCCTATTGTTATCAAAGACATATTCATGGGGAAATTCCT 

ATCAATTGACATTAATGGTTTTTGGTTTTTTATTCTTATTCGGATTATTGTTA 

TCTCTATATTTAAGAAAATTAACAACTAAAG1TGTGTAG 

LKNLNRYVVAVSGWLHLMLGSTYAWSVFRNPIISETGWDISSVSFAFSLAIFC 

LGMSAAFMGHLVERFGPRIMGM1SAILYGAGNVLTGLAIETQQLWLLYVAYG 

ILGGIGLGSGYITPVSTIIKWFPDRRGLATGFAIMGFGFASLVTSPLAQSLLIRIG 

VGKTFYILGLVYFFVMMIASQFIKQPPQEKITILTHDGKKNAMNSQIITGLKAN 

VAIKSKTFYIIWLTLFINISCGLGLISAASPMAQDLAGYSAESAALLVGVLGIFN 

GFGRLLWASLSDYIGRPLTFIILFIVNFIMTSSLFLSFNAIVFAIAMSILMTCYGA 

GFSLLPAYLSDIFGTKELATLHGYSLTAWAIAGLFGPLLLSKTYSWGNSYQLTL 

MVFGFLFLFGLLLSLYLRKLTTKVV* 



Sequence description: 

A] Length 1221 bp - 407 a.a (full length 
gene). 

B] TTG start codon with Shine-Dalgamo 
sequence upstream. Obvious signal peptide, 
with hydropathy plot exhibiting many possible 
membrane spanning regions, indicating protein 
to be transmembrane. 



FIG. 1 CONT'D 
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ID-82 



Clone 48 

ATGGCAGATAAAAACAGAACATTTAAACTTGTAGGTGCAGGATCTTCTAG 

CACACAAGAAAAAATTGAAAAGCCTGCTCTTTCGTTTATGCAAGATGCGTG 

GCGTCGCTTGAAAAAAAACAAATTAGCAGTAGTTTCACTCTATTTATTAGC 

TCTT1 IACTTAC1 1 1 Tl CGTTAGCCTCAAATTTATTTGTAACTCAGAAGGAT 

GCTAATGGGTTTGATTCGAAAAAAGTAACGACATATCGCAACTTACCACCT 

AAATTGAGTTCAAACCTTCCTTTTTGGAATGGTAGCATTAATCCATCA 

MADKNRTFKLVGAGSSSTQEKIEKPALSFMQDAWRRLKKNKLAVVSLYLLA 
LLLTFSLASNLFVTQKDANGFDSKKVTTYRNLPPKLSSNLPFWNGSrNPS 



Sequence description: 

A] Current length is 303 bp - 101 aa 

B] No obvious signal peptide but Shine 
Dalgamo sequence upstream of the ATG start 

codon. Not ide3ntified directly using the LEEP system but was found 
directly downstream of ID-34 described in WO 00/06736. 



ID-83 



Clone 98 



ATGAAAATAGTAGTACCAGTAATGCCTCGCAGTCTTGAAGAGGCTCAAGA 
AATAGATTTATCAAAATTTGATAGTGTTGATATTATTGAATGGCGAGCTGA 
TG CCTTAC C AAAGG ATG AC ATTATTAATGTAG CTCC AG CTATTTTTG AG AA 
ATTCGC AGGTC ATG AAATTA HT11 ACTTTTCGTAC AACGCGTG AAGGTGG 
TAATATTGTCTTATCTGATGCTGAGTATGTTGAGTTAATCCAGAAAATTAA 
TTCTATCTACAATCCAGATTATATTGATTTTGAGTA I 11 1 1 CAC ATAAAGAA 
GTT1 1 1 CAAGAAATGCTAGAATTTCCAAATTTAGTCCTGTCTTATCACAATT 
TTC AAG AG AC ACC GG AG AATATTATG G AG ATATTTTC A G AATTAAC AGCC 
CTAGCACCACGAGTTGTGAAAATCGCAGTAATGCCAAAGAATGAACAAGA 
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TGTCTTAGACGTTATGAATTACACTCGCGGTTTCAAGACTATTAATCC TGA 
TCAAGTTTATGCGACGGTATCTATGAGTAAAATTGGACGTATTTCTCGTTTT 
G CTG GTG ATGT AACTG G ATCT AGTTG G AC ATTTG C AT ATTT AG ATTC ATCT 
ATCGCACCCGGACAAATTACTATTTCAGAGATGAAGCGTGTCAAAGCATT 

GCTTGACGCTGACTGA 

MKIVVPVMPRSLEEAQEIDLSKFDSVDIIEWRADALPKDDIINVAPAIFEKFAG 
HEIIFTTOTTREGGNIVLSDAEYVELIQKINSIYNPDYIDFEYFSHKEVFQEMLEF 
PNLVLSYHNFQETPENIME1FSELTALAPRVVKIAVMPKNEQDVLDVMNYTRG 
FKTINPDQVYATVSMSKIGRISRFAGDVTGSSWTFAYLDSSIAPGQ1TISEMKRV 

KALLDAD* 



Sequence description: 

A] Length 678 bp, 225 aa (full length gene) 

B] No obvious signal peptide, but there is a 
Shine Dalgarno immediately upstream of ORF. 



ID-84 



Clone RS-52 



ATGAAAGACTTATTTGCAACAACAGAAGCATCATCAAGGAAACAGGAACA 

AGATAGAATTGTCAATTACATAAAACAACATGTTGAGTTAACAAATGGTA 

ATCAAATAAAAAAAATTGAGTTTATCGACTTTCAAA AAAA TGAGATGACA 

GGTACATGGGGAATTTCTACTAAAATTAATGAACAATTTTCGATTAGTTTT 

TCTGAAGATAGAATTGGTGGTAAACTTAGAGCATTAGGATATCAACCGAA 

TGAAATAGGTTTTTCAAAGGACATCAATAGTAATAATCAAAATGTTAATGA 

TATTGAAGTGATTTATATGAAGAAAGAATAG 

MKJ)LFATTEASSRKQEQDRIVNYIKQHVELTNGNQIKKIEFIDFQKNEMTGTW 
GISTKINEQFSISFSEDRJGGKLRALGYQPNEIGFSKDINSNNQNVNDIEVIYMK 

KE* 

Sequence description: 

A] length: 333 bp - 11 1 aa (partial sequence) 

B] No obvious Shine Dalgarno sequence upstream 
of the ATG start codon, and no obvious signal 
peptide within the protein. 



FIG. 1 CONTD 
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ID-85 

Clone RS-53 



ATGAAAAAACGTATATGGTATTTGATAATAATAATCACAGTAATTTTAGGA 

GGACTAGCCATGAAAAACTTATTTGCAACAACAGAAGCATCATCAAGGAA 

ACAGGAACAAGATAGAATTGTCAATTACATAAAACAACATGTTGAGTTAA 

CAAATGGTAATCAAATAAAAAAAATTGAGTTTATCGACTTTCAAAAAAAT 

G AG ATG AC AGGT AC ATG G G G AATTTCTACTAAAATT AATG A AC AATTTTC G 

ATTAG 1 TIT I CTG AAG ATAG AATTGGTGGTAAACTTAG AGC ATTAGG ATAT 

CAACCGAATGAAATAGGTTTTTCAAAGGACATCAATAGTAATAATCA 

MKKJUWYL1I1ITVILGGLAMKNLFATTEASSRKQEQDRIVNYIKQHVELTNGN 

QIKKIEFIDFQKNEMTGTWGISTKINEQFSISFSEDRIGGKLRALGYQPNEIGFSK 

DINSNNQ 



Sequence description: 

A] Length: 351 bp - 117 aa (Partial sequence) 

B] Obvious signal peptide and Shine Dalgamo 
sequence upstream of the ATG start codon. 



ID-86 

Clone ID-74 

ATGTCAAATCAATATGATTATATCGTTATTGGTGGAGGTAGT 

GCAGGCAGTGGTACCGCTAATAGGGCAGCCATGTATGGAGC 

AAAAGTCCTGTTAATTGAAGGTGGACAAGTAGGTGGAACTTG 

TGTTAACTTAGGTTGTGTACCTAAGAAAATCATGTGGTATGG 

TGCACAAGTTTCTGAGACACTCCATAAGTATAGTTCAGGTTA 

TGGTTTTGAAGCCAATAATCTTAGTTTTGATTTTACTACTCTA 

AAAGCTAATCGCGATGCTTACGTGCAGCGGTCTAGACAGTCG 

TATGCCGCTAATTTTGAGCGTAATGGGGTCGAAAAGATTGAT 

GGATTTGCTCGTTTTATTGATAACCATACTATTGAAGTGAATG 

GTCAGCAATATAAAGCTCCTCACATTACTATTGCAACAGGTG 



FIG. 1 CONTD 
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GACACCCTCTTTACCCTGATATTATTGGAAGTGAACTTGGTG 

AGACTTCTGATGATTTTTTTGGATGGGAGACCTTACCAAATTC 

TATATTGATTGTTGGGGCGGGCTATATCGCGGCAGAACTTGC 

TGGAGTGGTTAATGAATTAGGCGTTGAAACCCATCTTGCATT 

TAGAAAAGACCATATTCTACGCGGATTTGATGACATGGTAAC 

AAGTGAGGTTATGGCTGAAATGGAGAAATCAGGTATCTCTTT 

ACATGCTAACCATGTACCTAAATCTCTTAAACGCGATGAAGG 

TGGCAAGTTGATTTTTGAAGCTGAAAATGGGAAAACGCTTGT 

CGTTGATCGTGTAATATGGGCTATCGGCC GTGG ACCAAATGT 

AGACATGGGACTTGAAAATACCGATATTGTTTTAAATGATAA 

AGATTATATCAAAACAGATGAATTTGAGAATACTTCTGTAGA 

TGGCGTGTATGCTATTGGAGATGTTAATGGGAAAATTGCCTT 

GACACCGGTAGCAATTGCAGCAGGTCGTCGCTTATCAGAAAG 

ACTTTTTAATCATAAAGATAACGAAAAATTAGATTACCATAA 

TGTACCTTC AGTTA 1 Tl T 1 ' ACTC ACCCTGTAATTGG G ACG GT A 

GGACTTTCAGAAGCAGCAGCTATCGAGCAATTTGGAAAAGAT 

AATATCAAAGTCTATACATCAACTTTTACCTCTATGTATACGG 

CTGTTACCAGTAATCGCCAAGCAGTTAAGATGAAGCTCATAA 

CCCTAGG AAAAG AGG AAA AAGTTATTGGG CTTCATG GTGTTG 

GTTATGGTATTGATGAAATGATTCAAGGTTTTTCAGTTGCTAT 

CAAAATGGGGGCTACTAAAGCAGACTTTGATGATACTGTTGC 

TATTCACCCAACTGGATCTGAGGAATTTGTTACAATGCGCTA 

A 

MSNQYDYIV1GGGSAGSGTANRAAMYGAKVLLIEGGQVGGTC 

VNLGCVPKKIMWYGAQVSETLHKYSSGYGFEANNLSFDFTTLK 

ANRDAYVQRSRQSYAANFERNGVEKIDGFARFIDNHTIEVNGQ 

QYKAPHITIATGGHPLYPDIIGSELGETSDDFFGWETLPNSILIVG 

AGYIAAELAGVVNELGVETHLAFRKDHILRGFDDMVTSEVMAE 

MEKSGISLHANHVPKSLKRDEGGKLIFEAENGKTLVVDRVIWAl 

GRGPNVDMGLENTDIVLNDKDYIKTDEFENTSVDGVYAIGDVN 

GKIALTPVAlAAGRRLSERLFNHKDNEKLDYFrNVPSVIFTHPVIG 

TVGLSEAAAIEQFGKDNIKVYTSTFTSMYTAVTSNRQAVKMKLI 

TLGKEEKVIGLHGVGYGIDEMIQGFSVAIKMGATKADFDDTVAI 

HPTGSEEFVTMR* 



ID-87 



FIG. 1 CONTD 
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Clone RS-55 



ATGACAAAAAAACATCTTAAAACGCTTGCCTTGGCACTTACTACAGTATCA 

GTAGTGACATACAGCCAGGAGGTATATGGATTAGAAAGAGAGGAATCGGT 

CAAACAAGAACAAACCCAGTCAGCTTCAGAAGATGATTGGTTCGAAGAAG 

ATAATGAGAGGAAAACAAATGTTTCTAAAGAGAATTCTACTGTTGATGAA 

ACAGTTAGTGATTTATTTTCTGATGGAAATAGTAATAACTCTAGTTCTAAA 

ACCGAGTCAGTGGTAAGTGACCCTAAACAAGTCCCCAAAGCAAAACCAGA 

GGTTACACAAGAAGCAAGCAATTCTAGTAATGATGCTAGCAAAGTAGAAG 

TACCAAAACAGGATACAGCTTCAAAAAAGGAAACTCTAGAAACATCAACT 

TGGGAGGCAAAAGATTTCGTAACTAGAGGGGATACTTTAGTAGG 11111 CA 

AA ATCTG G AATT AATAAGTT ATCTC AAAC ATC AC ACTTG G 11 1TACCAAGT 

CATGCAGCAGATGGAACTCAATTGACACAAGTAGCTAGCTTTGCTTTTACT 

CCAGATAAAAAGACGGCCATTGCAGAATATACAAGTAGGCTAGGAGAAA 

ATGGGAAACCGAGTCGTTTAGATATTGATCAGAAGGAAATTATTGATGAG 

GGAGAAATATTTAATGCTTACCAGTTGACTAAGCTTACTATTCCAAATGGT 

TATAAGTCTATTGGTCAAGATGCTTTTGTGGACAATAAGAATATTGCTGAG 

GTTAACCTTCCTGAGAGTCTCGAGACTATTTCAGACTATGCTTTTGCTCACA 

TGTCTTTAAAACAAGTAAAGTTACCAGATAACCTAAAGGTCATTGGAGAA 

TTAGC1 TTTTTI G ATA ATC AG ATTGGTGGTAAGCTTTACTTGCCACGTC ACT 

TGATAAAATTAGCAGAACGCGCTTTCAAATCTAATCGTATTCAAACAGTTG 

AATTTTTGGGAAGTAAGCTTAAGGTTATAGGAGAAGCAAGTTTTCAAGAT 

AATAATCTGAGGAATGTTATGCTTCCGGATGGACTTGAAAAAATAGAATC 

AGAAGCTTTTACAGGAAATCCAGGAGATGAACATTACAACAATCAGGTTG 

TATTGCGCACAAGGACAGGCCAAAATCCACATCAACTTGCGACTGAGAAT 

ACTTACGTCAATCCGGACAAATCATTGTGGCGTGCAACACCTGATATGGAT 

TATACCAAATGGTTAGAGGAAGATTTTACCTATCAAAAAAATAGTGTTACA 

GGTTTTTCAAATAAAGGCTTACAAAAGGTAAGACGTAATAAAAACTTAGA 

AATTCCAAAACAACACAATGGTATTACTATTACTGAAATTGGTGATAACGC 

TTTTCGCAATGTTGATTTTCAAAGTAAAACTTTACGTAAATATGATTTGGA 

AGAAATAAAGCTCCCCTCAACTATTCGGAAAATAGGTGCTTTTGCTTTTCA 

ATCTAATAACTTGAAATCCTTTGAAGCAAGTGAAGATTTAGAAGAGATTA 

AAGAGGGAGCCTTTATGAATAATCGTATTGGAACTCTAGACTTGAAAGAC 

AAACTTATCAAAATAGGTGATGCTGCTTTCCATATTAATCATATTTATGCC 

ATTGTTCTTCCAGAATCTGTACAAGAAATAGGACGTTCAGC 1 111 CGACAA 

AATG GTGCG CTTC ACCTTATGTTTATC GG AAATAAG GTTAAA AC AATTGGT 

GAAATGGCTTTTTTATCCAATAAACTGGAAAGTGTAAATCTCTCTGAGCAA 

AAAC AATTAAAGACAATTGAGGTCCAAGC 1 1 1 111 CGGATA ATGCCCTTAGT 

GAAGTAGTCTTACCGCCAAATTTACAGACTATTCGTGAAGAGGCTTTCAAA 

AGGAATCATTTGAAAGAAGTGAAGGGTTCATCTACATTATCTCAGATTACT 

TTTAATGCTTTTGATCAAAATGATGGGGACAAACGCTTTGGTAAGAAAGTG 

GTTGTTAGGACACATAATAATTCTCATATGTTAGCAGATGGTGAGCGTTTT 

ATCATTGATCCAGATAAGCTATCTTCTACAATGGTAGACCTTGAAAAGGTT 
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TTAAAAATAATCGAAGGTTTAGATTACTCTACATTACGTCAGACTACTCAA 

ACTCAGTTTAGAGAAATGACTACTGCAGGTAAAGCGTTGTTATCAAAATCT 

AACCTCCGACAAGGAGAAAAACAAAAATTCCTTCAAGAAGCACAATTTTT 

CCTrGGTCGCGTTGATTTGGATAAAGCCATAGCTAAAGCTGAGAAGGCTTT 

AGTGACCAAGAAGGCAACAAAGAATGGTCATTTGCTTGAGAGGAGTATTA 

ACAAAGCGGTATTAGCTTATAATAATAGTGCTATTAAAAAAGCTAATGTTA 

AGCGCTTGGAAAAAGAGTTAGACTTGCTGACAGATTTAGTCGAGGGAAAA 

G GACC ATTAG CGC AAGCTAC AATG GTAC AAGGAGTTTATTTATT AAAG AC 

GCCTTTACCATTGCCAGAATATTATATCGGATTGAACGTTTATTTTGACAA 

GTCTGGAAAATTGATTTATGCACTTGATATGAGTGATACTATTGGCGAGGG 

ACAAAAAGATGCATATGGTAATCCTATATTAAATGTTGACGAGGATAATG 

AAGGTTATCATACCTTGGCAGTTGCCACTTTAGCTGATTATGAAGGTCTTT 

ATATTAAAGATATTTTAAATAGTTCCCTTGATAAGATTAAAGCAATACGCC 

AGATTCCTTTGGCAAAATATCATAGATTAGGAATTTTCCAAGCTATCCGAA 

ATGCAGCGGCAGAAGCAGACCGATTGCTTCCTAAGACACCTAAGGGGTAC 

CTAAATGAAGTCCCAAATTATCGTAAAAAACAAATGGAGAAAAATTTAAA 

ACCAGTTGATTATAAAACGCCGAl 1 11 1 AATAAGGCTTTACCTAATGAAAA 

GGTAGACGGTGATAGAGCGGCTAAAGGTCATAATATAAATGCGGAGACTA 

ATAATTCTGTAGCTGTAACACCAATAAGGTCCGAGCAGCAATTACATAAGT 

CACAGTCTGATGTAAATTTACCTCAAACAAGTTCTAAAAATAATTTTATAT 

ACGAGATTCTAGGATACGTTAGTTTATGTTTGCTTTTCCTAGTAACTGCTGG 

GAAAAAAGGAAAACGAGCAAGAAAATAA 

MTKKHLKTLALALTTVSVVTYSQEVYGLEREESVKQEQTQSASEDDWFEEDN 

ERKTNVSKENSTVDETVSDLFSDGNSNNSSSKTESVVSDPKQVPKAKPEVTQE 

ASNSSNDASKVEVPKQDTASKKETLETSTWEAKDFVTRGDTLVGFSKSGINKL 

SQTSHLVLPSHAADGTQLTQVASFAFTPDKKTAIAEYTSRLGENGKPSRLDIDQ 

KEIIDEGEIFNAYQLTKLTIPNGYKSIGQDAFVDNKNIAEVNLPESLETISDYAF 

AHMSLKQVKLPDNLKVIGELAFFDNQIGGK.LYLPRHLIKLAERAFKSNRIQTV 

EFLGSKLKVIGEASFQDNNLRNVMLPDGLEKJESEAFTGNPGDEHYNNQVVLR 

TRTGQNPHQLATENTYVNPDKSLWRATPDMDYTKWLEEDFTYQKNSVTGFS 

NKGLQKVRRNKNLEIPKQHNGITITEIGDNAFRNVDFQSKTLRKYDLEEIKLPS 

TIRKIGAFAFQSNNLKSFEASEDLEEIKEGAFMNNRIGTLDLKDKLIKIGDAAFH 

INHIYAIVLPESVQEIGRSAFRQNGALHLMFIGNKVKTIGEMAFLSNKLESVNL 

SEQKQLKTIEVQAFSDNALSEVVLPPNLQTIREEAFKRNHLKEVKGSSTLSQITF 

NAFDQNDGDKJ^GKKVWRTHNNSHMLADGERFIIDPDKLSSTMVDLEKVL 

KIIEGLDYSTLRQTTQTQFREMTTAGKALLSKSNLRQGEKQKFLQEAQFFLGR 

VDLDKAIAKAEKALVTKKATKNGHLLERSINKAVLAYNNSAIKKANVKRLEK 

ELDLLTDLVEGKGPLAQATMVQGVYLLKTPLPLPEYYIGLNVYFDKSGKLIYA 

LDMSDTIGEGQKDAYGNPILNVDEDNEGYHTLAVATLADYEGLYIKDILNSSL 

DKJKAIRQIPLAKYHRLGIFQA1RNAAAEADRLLPKTPKGYLNEVPNYRKKQM 

EKNLKPVDYKTPIFNKALPNEKVDGDRAAKGHNINAETNNSVAVTPIRSEQQL 

HKSQSDVNLPQTSSKNNFIYEILGYVSLCLLFLVTAGKKGKRARK* 
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Sequence description: 

A] Length 3168 bp - 1056 aa (Partial sequence) 

B] Obvious signal peptide with Shine Dalgamo 
sequence upstream of the ATG start codon. 



ID-88 



Clone RS-56 



GCAGGATACATCATGCACAAGCACGAGGCTATCGTGTCATGCTGGGGTCA 
ACCCAGGAAGACATGTCGGCACAAGCTGAAGATTTCTTTACAGTCTGTACA 
CAATAAAGAGACGGGTAAGAGCGCTTTTAATGACAAAGAACGACTAGCAA 
TT 

AGYIMHKHEAIVSCWGQPRKTCRHKLKISLQSVHNKETGKSAFNDKERLAI 



Sequence description: 

A] Length: 153 bp - 51 aa (partial sequence) 

B] No signal peptide visible, insufficient 
sequence data to determine the presence of a 
Shine Dalgamo sequence. 



ID-89 



Clone RS-58 

GTGTCATTTATGCAAAGAAAATCCTATTTAAAATCCATGAGTGTTCTTACT 

TTAACAGCTTGTCTTATATCAGGATATGTGGTTAAAGATATTGCTATGTTA 

CATGCAGTATCTGCCAGTGAGAAGAAAGCAAATAATGTCAGTCCGAGAGA 

AAATCTCTACAGGGCTGTCAATGATAATTGGCTAGCCAATACAAAACTCA 

AACAAGGGCAGACTAGTGTTAATAGTTTTTCAGAAATTGAGGATAAATTA 

AAGCAACTGTTAGTGTCTGATATGGCTAAAATGGCCTCAGGAAAGATTGA 
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AACAACCAATGATGAACAGAAAAAAATGGTTGCATACTATAAACAAGGTA 

TGGACTTTAAAACAAGAGATAAAAATGGTCTCAAACCTCTAAAACCAGTT 

TTACAAAAACTTGAAGCAGTCTCTTCAATGAAAGACTTTCAAAGTTTGGCC 

CATGATTTTGTGATGAGTGGTTTTGTTTTACCATTTGGTTTGACTGTGGAAA 

CCAATGCTCGAGATAATAGCCAAAAGCAATTGGTGCTTCGTCAAGCACCC 

GCATTACTTGAATCACCTGACCAATATAAGAAGGGCAATAAAGAAGGTGA 

GGCTAAATTATCAGCTTACCGTACTTCAGCAATGGCTTTGCTTAAACAAGC 

TGGAAAAAGTAACATTGAAGATAGAAAACTAGTTAAACAAGCTATAGCAT 

TTGATAGACTCTTATCAGAAAAAACGCAAGTTGATCAAAGTAAAATCACA 

GCTGAAAGTGAGACAGCTGCGGGGCGATATAACCCTGAAAGTATGGAAAC 

GGTTCACAATTACGCCAAGGAATTTGACTTTAAAGAATTGATTGAAAAACT 

AGTTGGGCCAACGAATAAGGCAGTCAATGTAGAAGATAAAACTTATTTTA 

AACAGGTTAATGATGTTATAAATAGTAAACAATTAGCCAATATGAAAGCA 

TGGATGATGATTTCTATGCTAGTTGATCAATCAGATTTTCTAGGAGAACAA 

AATCGTCAAGCAGCGAGTGCTTTTAAGAATGTTGCGTCTGGTTTGACTCAG 

ATTG AATCG AAAG AA AAAATG CTTAC ACCC AATTAG 



MSFMQRKSYLKSMSVLTLTACLISGYVVKDIAMLHAVSASEKKANNVSPREN 

LYRAVNDNWLANTKLKQGQTS\^SFSEIEDKLKQLLVSDMAKMASGKIETTN 

DEQKKMVAYYKQGMDFKTRDKNGLKPLKPVLQKLEAVSSMKDFQSLAHDF 

VMSGFVLPFGLTVETNARDNSQKQLVLRQAPALLESPDQYKKGNKEGEAKLS 

AYRTSAMALLKQAGKSNIEDRKLVKQAIAFDRLLSEKTQVDQSKITAESETAA 

GRYNPESMETVHNYAKEFDFKELIEKLVGPTNKAVNVEDKTYFKQVNDVINS 

KQLANMKAWMMISMLVDQSDFLGEQNRQAASAFKNVASGLTQIESKEKMLT 

PN* 



Sequence description: 

A] Length: 1095 bp - 365 aa (full length gene) 

B) an GTG (possible ATG start codon located 7 bp 
further downstream) start codon with an obvious 
signal peptide. Shine Dalgamo sequence present 
upstream of the ORF. 



ID-90 



Clone RS-59 



FIG. 1 CONTD 
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ATG G AAATGCCT AAAAG AAATG AATTA CTC AAT A AA G A AATTAAA ATG A G 
TATTGATAAACTTAGATATAAAGAACCAGAGAGTGAACATGACAAGCGAC 
CT ACTTTTT ATTTG GT AGTACTT ATACTTGTT ACTGTAG C AGTTAT ATTGTC 
GTTATTTAAATA1 11 i 1 1 ATAG 

MEMPKRNELLNKEIKMSIDKLRYKEPESEHDKRPTFYLVVLILVTVAVILSLFK 
YFL* 



Sequence description: 

A] Length: 174 bp - 58 aa(full length gene) 

B] No obvious signal peptide, but Shine 
Dalgarno sequence is present upstream of ATG 
start codon. 



ID-91 



Clone RS-62 (partial sequence) 



ATGCAGGTATTTTTAAATATTGTCAATAAATTCTTTGATCCAGTTATTCATA 
TGGGTTCGGGAGTTGTGATGCTAATTGTCATGACAGGTTTAGCCATGATAT 
TTGGAGTGAAGTTTTCTAAAGCACTTGAAGGTGGTAT 

MQVFLNIVNKFFDPV1HMGSGVVMLIVMTGLAMIFGVKFSKALEGG 



Sequence description: 

A] Length: 141 bp - 41 aa (partial sequence 

B] Shine Dalgamo sequence present upstream of 
ATG start codon with a possible signal peptide 
present 



ID-92 
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Clone RS-69 (partial sequence) 



ATGAAAAAGAAAACATTCAGTGCTTATAACTTTTTAACGGCTCTTATCCTT 
TGTCTTTTG AC AGTGC 1 TTTI ATCTTTCC ATTTTATTGG ATTATG AC AGG AG 
CTTTTAA 

MKKKTFSAYNFLTALILCLLTVLFIFPFYWIMTGAF 



Sequence description: 

A] Length: 1 10 bp -36 aa (Partial sequence) 

B] Possible signal peptide with Shine Dalgamo 
sequence directly upstream of the ATG start 
codon. 



ID-93 

Clone RS-70 



ATGACTGAGAACTGGTTACATACTAAAGATGGTTCAGATATTTATTATCGT 

GTCGTTGGTCAAGGTCAACCGATTGTTTTTTTACATGGCAATAGCTTAAGT 

AGTCGCTATTTTGATAAGCAAATAGCATATTTTTCTAAGTATTACCAAGTT 

ATTGTTATGGATAGTAGAGGGCATGGCAAAAGTCATGCAAAGCTAAATAC 

CATTAGTTTCAGGCAAATAGCAGTTGACTTAAAGGATATCTTAGTTCATTT 

AGAGATTGATAAAGTTATATTGGTAGGCCATAGCGATGGTGCTAATTTAGC 

tttag rrm C AAACG ATGTTTCC AG AT ATG GTT AG AG GG CTTTTG CTT AAT 

TCAGGGAACCTGACTATTCATGGTCAGCGATGGTGGGATATTCTTTTAGTA 

AGG ATTGC CTATA AATTCCTTC ACTATTTAG G G AAACTCTTTC C GTAT ATG 

AGGCAAAAAGCTCAAGTTATTTCGCTTATGTTGGAGGATTTGAAGATTAGT 

CCAGCTGATTTACAGCATGTGTCAACTCCTGTAATGGTTTTGGTTGGAAAT 

AAGGACATAATTAAGTTAAATCATTCTAAGAAACTTGCTTCTTATTTTCCA 

AGGGGGGAGTTTTATTCTTTAGTTGGCTTTGGGCATCACATTATTAAGCAA 

G ATTCCC ATG rTTTl AATATTATTGC AA AAAAGTTTATC AACG ATACGTTG 

AAAGGAGAAATTGTTGAAAAAGCTAATTGA 

MTENWLHTKDGSDIYYRVVGQGQPIVFLHGNSLSSRYFDKQIAYFSKYYQVIV 
MDSRGHGKSHAKLNTISFRQIAVDLKDILVHLEIDKVILVGHSDGANLALVFQ 
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TMFPDMVRGLLLNSGNLTIHGQRWWDILLVRIAYKFLHYLGKLFPYMRQKA 

QVISLMLEDLKISPADLQHVSTPVMVLVGNKDIIKLNHSKKLASYFPRGEFYSL 

VGFGHHIIKQDSHVFNIIAKKFINDTLKGEIVEKAN* 



Sequence description: 

A] Length: 744 bp - 248 aa (full length gene) 

B] No obvious signal peptide, but Shine 
Dalgamo sequence upstream of the ATG start 
codon. 



ID-94 



Clone RS-71 



ATGGTAGCAAAAGAGTTAGGTAAAAATAGCTTTACTATCCCAACTATTTGT 

TCTAATTGCTCCGCAGGTACTGCCATTGCAGTTGTATATAATGATGACCAT 

TCTTTCTTAAGATACGGCTATCCCGAGTCTCCACTTCATATTTTTATCAATA 

CACGGATCATTGCACAGGCACCAAGCAAATATTTTTGGGCTGGTATTGGGG 

ACGGTATTTCAAAAGCCCCTGAAGTAGAACGTGCTACCTTAGAGGCTAAG 

ACCAATAAACTACCACATACTGCAGTGTTAGGACAAGCAGTCGCTCTGTCT 

TCAAAGGAAGCTTTTTATCAATTTGGTGAACAAGGTCTAAAAGACGTTGAA 

GCTAATTTAGCTTCGCGTGCAGTTGAAGAAATTGCGCTTGATATCTTA 

MVAKELGKNSFTIPTICSNCSAGTAIAVVYNDDHSFLRYGYPESPLHIFINTPJIA 

QAPSKYFWAGIGDGISKAPEVERATLEAKTNKLPHTAVLGQAVALSSKEAFY 

QFGEQGLKDVEANLASRAVEEIALDIL 



Sequence description: 

A] Length: 405 bp - 135 aa (Partial sequence) 

B] No obvious Shine Dalgamo sequence upstream 
of the ATG start codon, probable signal 
peptide present at the N-terminus. 

ID-95 
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Clone RS-73 



TTGAGGGAAACTTACTGGAAAATTTCAAGCGATTGCGATAAAATAAATCTT 

GCAGAGTTTTCTAGAGAAAGGAGGTCAGATTTATTGGAGTGGCAAGATCT 

AGCGCAGTTACCTGTATCTATTTTTAAAGACTATGTTACAGATGCTCAAGA 

CGCGGAAAAACCTTTTATATGGACAGAAGTATTTTTAAGGGAGATTAATCG 

CTCAAATCAAGAAATTATTTTGCATATTTGGCCGATGACTAAGACAGTCAT 

TCTGGGGATGTTAGATCGAGAATTACCACATTTAGAATTAGCTAAAAAAG 

AAATCATCAGTCGTGGTTATGAACCAGTTGTTCGGAATTTTGGAGGTCTCG 

CAGTTGTAGCTG ATG AAGG AATTTTAAA 1 " 1 " 1 TIC ATTGGTTATTCC AG ATGT 

TTTTGAGAGAAAATTGTCTATCTCAGATGGGTATCTTATAATGGTCGATTTT 

ATTAGAAGTATATTTTCGG A ITTT1 ATCAACCTATTGAGC ACTTTG AAGTA 

GAGACCTCCTATTGTCCTGGTAAGTTTGATCTTAGTATAAATGGCAAAAAA 

TTTGCTGGCTTGGCTCAGCGCCGTATAAAGAATGGTATTGCGGTATCAATT 

TACCTTAGCGTTTGTGGCGATCAAAAAGGGCGGAGTCAAATGATTTCAGAT 

TTTTATAAGATTGGTCTAGGTGATACGGGTAGTCCAATTGCTTATCCAAAT 

GTAGATCCTGAAATTATGGCTAATCTATCTGATCTATTAGATTGTCCTATG 

ACAGTAGAAGATGTTATTGATCGTATGTTGATTAGCCTTAAACAAGTAGGT 

TTTAATGATCGTTTACTGATGATTAGACCCGATTTAGTTGCAGAGTTTGAT 

AGATTTCAGGCTAAGTCTATGGCTAATAAGGGGATGGTGAGCAGAGATGA 

ATAA 



MRETYWKISSDCDKINLAEFSRERRSDLLEWQDLAQLPVSIFKDYVTDAQDAE 

KPFIWTEVFLREINRSNQEIILHIWPMTKTVILGMLDRELPHLELAKKEIISRGYE 

PVVRNFGGLAVVADEGILNFSLVIPDVFERKLSISDGYLIMVDFIRSIFSDFYQPI 

EHFEVETSYCPGKFDLSINGKKFAGLAQRRIKNGIAVSIYLSVCGDQKGRSQMI 

SDFYKIGLGDTGSPIAYPNVDPEIMANLSDLLDCPMTVEDVIDRMLISLKQVGF 

NDRLLMIRPDLVAEFDRFQAKSMANKGMVSRDE* 



Sequence description: 

A] Length: 921 bp -307 aa (Full-length gene sequence) 

B] No obvious Shine Dalgarno sequence upstream 
of the TTG start codon or signal peptide 
visible. Actual start point may be a further 

85 bp downstream (TTG). This start point is 
preceded by a typical Shine-Dalgarno sequence. 
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ID-96 



Clone RS-74 



TTGGAAGGTTTACTTATTGCATTGATTCCCATGTTTGCGTGGGAAAGTATT 

GGATTTGTTAGTAATAAAATTGGAGGGCGTCCAAATCAACAAACATTTGG 

AATGACTTTAGGAGCATTGCTATTTGCGATTATCGTATGGTTATTTAAACA 

GCCAGAGATGACTGCCTCATTGTGGATTTTTGGTATCTTAGGTGGTATCCT 

ATGGTCAGTCGGCCAAAATGGTCAATTTCAAGCAATGAAATATATGGGAG 

TCTCTGTTGCTAATCCACTGTCAAGTGGTGCACAATTAGTAGGTGGAAGCC 

TAGTTGGTGCTTTAGTCTTTCATGAATGGACTAAGCCAATCCAATTTATTTT 

AGGATTGACAGCGTTGACATTATTAGTTATCGGCTTCTATTTGTCAAGTAA 

ACGTGATGTTTCAGAACAAGCTTTGGCAACACATCAAGAGTTTTCAAAAG 

GATTTGCTACAATTGCTTATTCAACTGTAGGTTACATCTCGTACGCAGTTTT 

ATTTAACAACATTATGAAGTTCGACGCTATGGCCGTCATTTTACCCATGGC 

TGTTGGAATGTGTCTAGGTGCAATTTGTTTCATGAAGTTTCGTGTTAACTTT 

GAGGCTGTTGTTGTTAAAAATATGATTACAGGTCTCATGTGGGGCGTTGGT 

AATGTCTTCATGTTATTGGCAGCAGCTAAAGCAGGGCTAGCAATTGCTTTT 

AGTTTTTCTC AACTTG G AGT AATT ATCTCT ATTATTG GTG GTATTTT ATTTTT 

AGGTGAGACAAAAACGAAGAAAGAGCAGAAATGGGTTGTCATGGGTATC 

CTTTGTTTTGTTATGGGTGCTATATTACTTGGTATTGTTAAATCTTATTAA 

MEGLLIAL1PMFAWESIGFVSNKIGGRPNQQTFGMTLGALLFA1IVWLFKQPEM 

TASLWIFGILGGILWSVGQNGQFQAMKYMGVSVANPLSSGAQLVGGSLVGAL 

VFHEWTKPIQFILGLTALTLLVIGFYFSSKPvDVSEQALATHQEFSKGFATIAYST 

VGYISYAVLFNNIMKFDAMAVILPMAVGMCLGAICFMKFRVNFEAVVVKNMI 

TGLMWGVGNVFMLLAAAKAGLAIAFSFSQLGVIISIIGGILFLGETKTKKEQK 

WVVMGILCFVMGAILLGIVKSY* 



Sequence description: 

A] Length: 867 bp - 289 aa (full-length gene) 

B] Posible Shine Dalgamo sequence upstream of 
GTG start codon, no obvious signal peptide 
present. 



ID-97 
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Clone RS-75 



ATGACAACTTACTACGAAGCTATAAACTGGAACGAAATTGAAGATGTTAT 

TGATAAATCAACTTGGGAAAAACTAACCGAACAATTTTGGCTCGATACAC 

GTATCCCTTTATCAAATGACTTAGACGATTGGCGCAAACTTTCCGCTCAAG 

AAAAAGATCTTGTTGGCAAGGTTTTTGGAGGCTTAACCCTACTTGATACCA 

TGCAATCAGAAACTGGTGTTGAAGCTATTCGTGCCGATGTTCGCACGCCTC 

ACGAAGAAGCTGTCTTAAACAATATTCAATTCATGGAATCTGTTCACGCTA 

AATCTTATTCTTCAATTTTCTCAACTTTAAATACTAAATCAGAAATTGAAG 

AAATTTTCGAGTGGACTAATAATAATGAGTTCCTTCAAGAAAAAGCACGT 

ATTATCAATGACATTTATGCTA ATGGAA ATGCCCT TCAAA AAAAGGTGGCT 

TCCACCTACCTCGAAACTTTCCTTTTTTATTCTGGC i 1 l l l CACACCTCTTTA 

CTATTTGGGAAATAATAAGTTAGCAAATGTTGCTGAAATCATTAAATTAAT 

TATTCGTGATGAATCTGTACATGGTACTTATATCGG TTAC AAATTCCAGCTT 

GGTTTTAACG AATTAC C AG AAG ATG AGC AAG AG AATTTTC GTG ATTG GAT 

GTATGACCTCCTTTATCAGCTGTATGAAAACGAAGAAAAA TACACC AAGA 

C ACTTTATG ATGG C GT AGG ATG G ACTG AAG AAGTT ATG ACC I 1 I 1 1ACGCT 

ACAATGCTAATAAAGCTCTTATGAATTTAGGACAAGATCCTTTATTCCCAG 

ATACAGCAAATGATGTCAACCCAATTGTTATGAATGGTATTTCAACAGGAA 

CATCAAACCATGACTTCTTCTCTCAAGTAGGTAATGGTTACCTACTTGGTA 

GCGTTG AAG CT ATG C ATG ATG ATG ACTAT AACT ATG G ATTAT AA 

MTTYYEAINWNEIEDVIDKSTWEKLTEQFWLDTRIPLSNDLDDWRKLSAQEK 

DLVGKVFGGLTLLDTMQSETGVEAIRADVRTPHEEAVLNNIQFMESVHAKSY 

SSIFSTLKTKSEIEEIFEWTNNNEFLQEKARIINDIYANGNALQKKVASTYLETF 

LFYSGFFTPLYYLGNNKLANVAEIIKLIIRDESVHGTYIGYKFQLGFNELPEDEQ 

ENFRDWMYDLLYQLYENEEKYTKTLYDGVGWTEEVMTFLRYNANKALMNL 

GQDPLFPDTANDVNPIVMNGISTGTSNHDFFSQVGNGYLLGSVEAMHDDDYN 

YGL* 



Sequence description: 

A] Length: 960 bp - 320 aa (full length gene) 

B] Shine Dalgamo sequence present upstream of 
ATG start codon, but no signal peptide 
present. 



ID-98 
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Clone RS-77 (partial sequence) 



ATGAATTGGTCACGTATCTGGGAACTCGTAAAAATTAATATCCTTTATTCA 

AACCCTCAGACTCTATCGGCACTAAGAAAAAAGCAAGAAAAGCATCCTAA 

AAAAGA ATTTT CAGCTTATAAATCCATGTTTAGAAATCAGTTATTTCAGAT 

TTTG CTCTTTTC AAT AATTT ATGT ATTTCTCTTTGT ATC A CTT G ATTTT AAA G 

AATATCCG GGCTA TTTCACGTTCTACATTGGTATCTTTACACTAGTATCCAT 

TATCTACTCTTTTATTGCGATGTACAGTGTTTTCTATGAGAGTGACGATGTT 

AA 

NWWSRIWELVKJNILYSNPQTLSALRKKQEKHPKKEFSAYKSMFRNQLFQILL 
FS1IYVFLFVSLDFKEYPGYFTFYIGIFTLVSIIYSFIAMYSVFYESDDV 



Sequence description: 

A] Length: 311 bp - 103 aa (Partial sequence) 

B] Shine Dalgarno sequence present upstream of 
ATG start codon, no obvious signal peptide at 
N-terminus. 

ID-99 



Clone RS-78 (partial sequence) 



TAATCTTTTAGTCAACGGAGCAACAGGAAAATTGCAGGCTATGCGACAGA 

TATTCCACCACATAATTTAGCAGAAGTCATTGATGCTGTCGTGTACATGAT 

TGA TCAC CCTAAAGCTAAATTAGATAAATTAATGGAATTTCTACCTGGTCC 

AGATTTTCCAACTGGCGCTATCATTCAAGGAAAAGATGAAATTCGTAAGG 

CATATGAGACTGGTAAGGGGAGAGTAGCGGTTCGCTCGCGAACTGCTATT 

GAAACCTTAAAAGGTGGTAAGAAACAAATTATTGTTACTGAAATTCCTTAT 

GAAGTTAAT 

SFSQRSNRJOAGYATD1PPHNLAEVIDAVVYMIDHPKAKLDKLMEFLPGPDFPT 
GAIIQGKDEIPJCAYETGKGRVAVRSRTAIETLKGGKKQI1VTEIPYEVN 



Sequence description: 



A] Length: 312 bp - 104 aa (Partial sequence) 

B] No obvious Shine Dalgarno sequence or a 



FIG. 1 CONTD 



t ,iiu n««i» 



1?! ".ill* 'Tti'.r 'l..tJ' 'lm? ttmii 



22/ 110 



signal peptide. Both N- and C- termini of ORF 
yet to be elucidated. 



ID-I00 



Clone RS-79 



ATGGGACGTAAGTGGGCCAATATTGTTGCCAAAAAGACTGCTAAAGATGG 

TGCTAACTCAAAAGTATACGCTAAATTCGGTGTTGAAATATATGTTGCTGC , 

AAAGCAAGGTGAACCAGACCCCGAGTCAAACTCAGCTCTAAAATTCGTTT 

TGGACCGTGCTAAGCAAGCACAAGTTCCAAAGCATGTTATTGATAAAGCG 

ATTGATAAAGCCAAAGGAAACACAGATGAAACTTTCGTAGAGGGACGCTA 

TGAAGGTTTTGGTCCAAATGGTTCAATGATTATTGTGGATACTTTGACATC 

AAATGTTAACCGTACGGCAGCAAATGTACGTACTGCTTACGGTAAGAACG 

GTGGCAATATGGGAGCTTCAGGATCGGTATCCTACTTATTTGATAAAAAAG 

GTGTCATCGTTTTTGCTGGTGATGATGCTGACACTGTCTTCGAACAATTACT 

TGAAGCGGATGTAGACGTAGATGATGTTGAAGCAGAAGAGGGAACAATA 

ACAGTTTATACCGCCCCAACAGATCTTCATAAAGGTATCCAAGCACTTCGC 

GATAATGGTGTAGAAGAATTCCAAGTTACTGAAC TTGAA ATGATTCCTCAA 

TC AG AAGTAGT ATTGG AAG GTG ATG AC CTTG AAACTTTTG A AAAG CTT 

MGRKWANIVAKKTAKDGANSKVYAKFGVEIYVAAKQGEPDPESNSALKFVL 
DRAKQAQVPKHVIDKAIDKAKGNTDETFVEGRYEGFGPNGSMIIVDTLTSNV 
NRTAANVRTAYGKNGGNMGASGSVSYLFDKKGVIVFAGDDADTVFEQLLEA 
DVDVDDVEAEEGT1TVYTAPTDLHKGIQALRDNGVEEFQVTELEMIPQSEVVL 

EGDDLETFEKL 



Sequence description: 

A] Length: 654 bp - 218 aa (Partial sequence) 

B] Possible Shine Dalgamo sequence upstream 
of ATG start, no obvious signal peptide 



ID-101 



Clone RS-80 
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TTGGAGAA ATATTT GAAGAACCCGATTACATGGATTGGATTAGTTCTTGTG 
GTT ACGTG GTTTTTAACTAAA AGTAGTG AATTTTTG ATTTTTG GTGTGTGTG 
TCTTGTTGTTAGTATTTGCTAGTCAAAGTGAT 

MEKYLKNPITWIGLVLVVTWFLTKSSEFLIFGVCVLLLVFASQSD 



Sequence description: 

A] Length: 135 bp - 45 aa (partial sequence) 

B] Shine Dalgamo sequence upstream of TTG 
start codon with possible signal peptide 
evident at N-terminus. 



ID- 102 
Clone RS-81 



ATGACACAATCAGATGCATATCTCTCGTTGAACGCGAAGACACGCTTTAGA 

GATCGCACAGGTAATTATCATTTTACTTCGGATAAAGAGGCTGTTGAACAA 

TATATGATAGAACATGTTGAACCTAATACGATGGTGTTCACATCACTAATT 

GAAAAGCTAGATTATTTGGTTTCTAATAACTACTATGAATCGGACCTTCTA 

AAACAATATA ACCTT GAGTTTATTTGCCAAATTTTTGAGCATGCATACGCT 

AAG AAATTTG CTTTTCT AAATTTTATGG G GG CTTT AAAATTTTATAATG CTT 

ATGCTCTTAAT 



MTQSDAYLSLNAKTRFRDRTGNYHFTSDKEAVEQYMIEHVEPNTMVFTSLIE 

KLDYLVSNNYYESDLLKQYNLEFICQIFEHAYAKKFAFLNFMGALKFYNAYA 
LN 



Sequence description: 

A] Length; 3 1 8 bp - 106 aa (Partial sequence) 

B] Shine Dalgamo sequence present upstream of 
ATG start codon, no obvious signal peptide 
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ID- 103 



Clone 2-1 1 A 



ATGGTATTTATGGCAAATAAGAAAAAAACAAAAGGAAAGAAAACCAGAA 

GACCTACTAAGGCAGAAATAGAGCGTCAAAGAGCTATTCAAAGGATGATT 

ACTGCTCTTGTTTTAACAATTATTCTCTTCTTTGGTATTATCAGATTAGGTA 

TTTTTGGTATTACAGTCTATAACGTCATCCGTTTTATGGTAGGTAGCTTGGC 

TTACTTATTTATTGCGG C AACTTTAATCT ACCTTTATTTCTTT AAATG GTTG 

CGAAAGAAAGATAGCTTAGTAGCAGGl 1111 lGATAGCTTCTTTAGGATTA 

TTGATTGAGTGGCATGCTTACCTTTTCTCAATGCCTATTTTGAAAGATAAA 

GAAATTTTGCGTTCAACTGCTCGATTAATTGTGTCTGATTTAATGCAATTTA 

AAATCACTGTTTTTGCCGGTGGAGGTATGTTGGGTGCTTTGATTTACAAGC 

CAATTGCTTTTCTCTTTTCTAATATTGGTGCCTATATGATTGGTGTTCTCTTC 

ATCATTTTGGGTCTCTTTTTAATGAGTTCTCTGGAAGTTTATGACATCGTCG 

AATTTATTAGAGCTTTTAAAAATAAAGTGGCAGAGAAGCACGAGCAAAAT 

AAAAAGGAGCGTTTTGCTAAGCGAGAGATGAAAAAAGCAATCGCTGAACA 

AGAGCGCATAGAGCGTCAAAAAGCTGAAGAAGAAGCTTATTTAGCTTCGG 

TTAATGTAGACCCTGAAACGGGTGAGATTCTAGAGGATCAAGCTGAGGAC 

AATTTGGATGATGCGCTACCACCTGAGGTAAGTGAAACATCAACTCCGGT 

ATTTGAGCCAGAGATCCTTGCTTATGAGACATCGCCTCAAAATGATCCTTT 

ACCAGTAGAGCCGACAATTTATTTAGAAGACTATGATTCGCCGATTCCTAA 

TATGAGAGAAAATGATGAGGAAATGGnTATGATTTAGATGATGATGTAG 

ATGATAGTGATATAGAAAATGTCGACTTTACACCTAAAACGACACTGGTTT 

ATAAATTACCAACGATAGATTTATTTGCACCAGATAAGCCTAAAAATCAAT 

CCAAAGAAAAGGATTTAGTCCGAAAGAATATCAGAGTTTTAGAAGAAACA 

TTTAGAAGTTTTGGTATCGATGTAAAAGTAGAACGTGCTGAAATTGGACCA 

TCAGTTACTAAATATGAAATTAAACCAGCAGTTGGAGTTCGTGTGAATCGT 

ATTTC AAATCTATCTG ACG ACCT AG CTCTTG CTCTTG C AG C A A AAG ATGTG 

CGTATAGAAGCACCAATTCCTGGAAAATCATTAATAGGTATTGAAGTTCCT 

AACTCAGAAATTGCAACGGTTTCTTTCCGCGAACTTTGGGAACAATCTGAT 

GCCAATCCTGAAAACCTTTTAGAAGTACCACTAGGAA AAGC TGTTAACGG 

CAATGCTCGCAGTTTTAACTTAGCTAGAATGCCGCATCTTTTGGTAGCTGG 

TTCAACTGGTTCAGGTAAATCTGTGGCAGTTAATGGAATTATTTCAAGTAT 

TTTGATGAAGGCACGTCCAGATCAAGTTAAGTTTATGATGATTGATCCCAA 

AATGGTTGAATTATCTGTTTATAATGATATTCCACATTTATTAATCCCTGTT 

GTAACCAATCCGCGTAAAGCAAGTAAGGCACTCCAAAAAGTTGTTGATGA 

AATGGAAAATCGATACGAGTTATTTAGCAAAATTGGTGTGCGTAATATAG 

CAGGTTATAATACAAAGGTTGAAGAGTTTAATGCTTCCTCTGAGCAAAAAC 

AAATGCCTTTGCCTTTAATCGTTGTCATTGTAGATGAATTGGCTGACTTGAT 

GATGGTTGCTAGTAAAGAAGTTGAAGATGCTATTATTCGTTTGGGGCAAAA 

AGCACGTGCTGCAGGTATCCATATGATTCTTGCAACTCAACGTCCATCCGT 



FIG. IcONTD 



25/110 



AGATGTTATTTCTGGTTTGATTAAAGCAAATGTTCCGTCGCGTATTGCATTT 

GCTGTTTCAAGTGGTACTGATAGCCGTACGATCCTTGATGAAAATGGTGCT 

GAAAAGCTCTTGGGACGGGGTGACATGCTCTTTAAGCCTATTGATGAGAAT 

C ATCC AGT ACG ACT AC AAG GTTCCTTTATTTC AG ATG ATG ATGTTG AAAG G 

ATCGTTGGTTTTATCAAAGACCAAGCCGAGGCTGACTATGATGATGCCTTT 

GATCCTGGAGAAGTATCTGAAACAGATAACGGCTCTGGTGGTGGCGGCGG 

AGTACCTGAAAGTGATCCTCTTTTTGAAGAAGCCAAGGGACTCGTTTTAGA 

GACGCAAAAAGCAAGTGCCTCAATGATTCAACGCCGATTGTCTGTTGGTTT 

CAATAGAGCAACAAGACTAATGGAAGAATTAGAAGCAGCGGGGGTTATTG 

GTCCAGCAGAAGGAACCAAGCCACGAAAAGTTTTAATGACTCCAACTCCG 

AGTGAATAA 

N^FMANKKKTKGKXTRRPTKAEIERQRAIQRMITALVLTIILFFGIIRLGIFGIT 

VYNVIRFMVGSLAYLF1AATLIYLYFFKWLRKKDSLVAGFLIASLGLLIEWHA 

YLFSMPILKDKEILRSTAJU-IVSDLMQFKJTVFAGGGMLGALIYKPIAFLFSNIG 

AYMIGVLFI1LGLFLMSSLEVYDIVEFIRAFKNKVAEKHEQNKKERFAKREMK 

KAI AEQERIERQ KAEEEA YLAS VNVDPETGE ILEDQ AEDN LDD ALPP E VSETST 

PVFEPEILAYETSPQNDPLPVEPTTYLEDYDSPIPNMRENDEEMVYDLDDDVDD 

SDIENVDFITKTTLVYKJ.PTIDLFAPDKPKNQSKEKDLVRKNIRVLEETFRSFGI 

DVKVERAEIGPSVTKYEIKPAVGVRVNRISNLSDDLALALAAKDVRIEAP1PGK 

SLIGIEVPNSEIATVSFRELWEQSDANPENLLEVPLGKAVNGNARSFNLARMPH 

LLVAGSTGSGKSVAVNGIISSILMKARPDQVKFMMIDPKMVELSVYNDIPHLLI 

PVVTNPRKASKALQKWDEMENRYELFSKIGVRNIAGYNTKVEEFNASSEQK 

QMPLPLIVVIVDELADLMMVASKEVEDAIIRLGQKARAAGIHM3LATQRPSVD 

VISGLIKANVPSR1AFAVSSGTDSRTILDENGAEKLLGRGDMLFKPIDENHPVRL 

QGSFISDDDVERIVGFIKDQAEADYDDAFDPGEVSETDNGSGGGGGVPESDPL 

FEEAKGLVLETQKASASMIQRRLSVGFNRATRLMEELEAAGVIGPAEGTKPRK 

VLMTPTPSE* 



Sequence description: 

A] Length: 2451 bp - 817 aa (Full-length gene) 

B] Shine Dalgamo sequence present upstream of 
ATG start codon, possesses a potential signal 
peptide 



ID-104 



Clone 2-18/22b 
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ATGTCAC AAGAG CAAGGAAAAATTTATATTGTAGAAGATGATATGACGAT 

TGTGT CACTT TTAAAAGATCATTTATCAGCTAGCTATCATGTCTCTAGTGTC 

AGCAATTTTCGTGATGTGAAACAAGAAATTATCGCATTTCAACCCGATTTG 

ATA CTAATG GAT ATT A C GTT A C CCTATTTT AATG GTTTTTACTG G ACTGC AG 

AATTGCGTAAGTTTTTAACAATTCCTATTATTTTCATTTCATCTAGTAATGA 

TGAAATGGATATGGTTATGGCATTAAATATGGGGGGTGATGACTTTATTTC 

AAAACCATTCTCTCTAGCTGTATTAGATGCTAAGCTAACTGCTATTTTAAG 

GAGAAGTCAACAATTTATCCAACAGGAATTAACTTTTGGGGGATTTACGTT 

GACAAGAGAAGGGTTATTGTCTAGCCAAGATAAAGAGGTTATTTTATCGC 

CAACAGAAAATAAAATCCTATCTATCTTGCTCATGCATCCTAAACAAGTAG 

TCTC AAAAG AGTCTCTATTAG AG A AACTTTGGG AAAATG ATAG 111 "1 ATTG 

ATCAAAATACACTTAATGTTAATATGACACGCTTACGTAAAAAAATTGTCC 

CAATAGGTTTTGATTACATTCATACAGTGAGAGGAGTTGGGTATTTACTAC 

AATGA 



MSQEQGKJYIVEDDMTIVSLLKDHLSASYHVSSVSNFRDVKQEIIAFQPDLILM 

DITLPYFNGFYWTAELRKFLTIP1IFISSSNDEMDMVMALNMGGDDFISKPFSLA 

VLDAKLTAILRRSQQFIQQELTFGGFTLTREGLLSSQDKEVILSPTENKILSILLM 

HPKQVVSKJESLLEKLWENDSFIDQNTLm^MTRLRKKIVPIGFDYlHTVRGVG 

YLLQ* 



Sequence description: 

A] Length: 669 bp - 223 aa (full-length gene 
sequence) 

B] Shine Dalgarno sequence present upstream of a GTG start codon. 
Was not identified directly by LEEP. This gene was found upstream of 
gene ID- 10 described in WO 00/06736. 

ID- 105 
Clone 2-20 



ATGTATCAAACTCAGACAAATAAGGAAAAATTTGTTTTATTTTTGAAATTA 

TTTATCCCAGTATTGATTTATCAATTTGCTAATtTTTCAGCTACTTTTATTGA 

TTCGGTTATGACTGGACAGTATAGTCAGCTACATTTGGCAGGTGTGTCAAC 

TG CT AGT AATTTATGGACTCCGTTTTTC GCTTTATTAGTAG GTATG ATTTC A 

GCATTAGTACCAGTAGTTGGTCAACATTTGGGTAGAGGAAATAAAGAACA 

AATTCGCACAGAATTTCATCAATTTCTATATTTAGGTTTGATACTGTCCTTA 

ATATTATTTTTAATCATGCAATTTATTGCTCAACCTGTCTTGGGGAGTTTGG 
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GTTTAGAAGATGAAGTTCTAGCAGTTGGTCGTGGTTATTTAAATTATATGT 

TGATTGGAATCATGCCGCTGGTGTTGTTTAGCATTTGCCGTTCATTCTTTGA 

TGCATTGGGGTTAACAAGGTTATCTATGTATCTGATGCTTTTAATTCTACCC 

TTT AATTC A 1 1111 1 AATTATATGCTTATCTACGGTAAATTTGGTATGCCTA 

GACTAGGAGGTGCGGGGGCAGGTCTTGGAACTTCTTTAACTTATTGGGCTA 

TTTTTATTGGTATTATTATTGTGATGTCACTTCATCCTCAAATTAAAACATA 

TCATATATGGACTCTGGAAAGAATAAAAGCTCCTTTGATTATTGAAGATAT 

TCGATTGGGATTACCGATTGGTTTACAAATTTTTGCAGAAGTTGCAATTTTT 

GCAGTAGTAGGCTTATTCATGGCAAAATTTTCTTCAATCATTATTGCAGCA 

CATCAGGCTGCTATGAATTTTTCATCATTAATGTATGCATTTCCTTTAAGTA 

TTTCCACTGCTCTAGCTATTACAATATCGTTTGAAGTAGGGGCAGAGCGCT 

TTCAGGACGCAACCACTTATAGTAGGATAGGACGCTTAACAGCGGTAGGG 

ATTACATCAGGAACCTTACTATTTTTATTTCTATTTCGTGAGAATGTAGCAG 

CAATGTATAATAGTGCCCCTCACTTTGTCGCTATTACAGCTCAATTCCTAAC 

TTATAGTCTC 1 111 1CCAGTTTGCAGATGCTTATGCAGCTCCTGTACAGGGG 

ATTTTACGAGGCTATAAGGATACAACAAAACCATTTATGATCGGTGCGGG 

CTCTTATTGGTTATGTGCTTTGCCATTAGCGGTTATCTTAGAAAAAAATAG 

CCAGTTAGGTCCGTTTGCCTATTGGATTGGTTTAATCACAGGTATTTTTGTT 

TGTGGTCTATTTCTAAACCAACGTCTGCAAAAGATTAAGAAGTTGTATTAT 

TAA 



MYQTQTNKEKFVLFLKLFIPVLIYQFANFSATFIDSVMTGQYSQLHLAGVSTAS 

NLWTPFFALLVGMISALVPVVGQHLGRGNKEQIRTEFHQFLYLGLILSLILFLI 

MQFIAQPVLGSLGLEDEVLAVGRGYLNYMLIGIMPLVLFSICRSFFDALGLTRL 

SMYLMLLILPFNSFFNYMLIYGKFGMPRLGGAGAGLGTSLTYWAIFIGIirVMS 

LHPQIKTYHIWTLER1KAPLIIED1RLGLPIGLQIFAEVAIFAWGLFMAKFSSIIIA 

AHQAAMNFSSLMYAFPLSISTALAITISFEVGAERFQDATTYSRJGRLTAVGITS 

GTLLFLFLFRENVAAMYNSAPHFVAITAQFLTYSLFFQFADAYAAPVQGILRG 

YKDTTKPFMIGAGSYWLCALPLAVILEKNSQLGPFAYWIGLITGIFVCGLFLNQ 

RLQKIKXL Y Y * 



Sequence description: 

A] Length: 1341 bp - 447 aa (full length gene) 

B] Shine-Dalgamo sequence present upstream of 
ATG start codon, There is a potential signal 
peptide sequence 



ID- 106 
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Clone 2-4A 



TTG CT AGTTT CTTCTCTAGTTTCTTGTTC ATTTTTTCTTGT C ATTTC GTC GTT 

GTCTTCATCAACACGAAATAAGTCTATAAACTTATCAAATAATTTCATAGA 

CTTATTATATCAA'i'l l'l CAATAAAATGCTATAATAAAACCATGTCA 1 11 1CA 

TTAAAAATTAGAAATCCATACGGTGAACATACCGTTAAAGAACTCCTTGA 

AGATTA1 11 m G ATTCCACGTAAGATTAGAC A 1111 1 1 GCGTGTTAAAAAA 

CATGTACTTATAAACAATGAATTCATTAATTGGCAAACTGTCGTCCAAGAA 

AACGATACTATTACCTTAATCTTTGATGATGAGGATTACCCTACTAAAAAA 

ATTCCTCTGGGCAGAGCAGAGCTTATTGATTGTCTTTATGAGGATGAACAT 

CTTATTATCGTTAATAAACCTGAAGGTATGAAAACTCACGGTAACCAACCA 

AATGAAATAGCACTGTTAAATCATGTATCTGCCTATTCTGGACAAACATGC 

TATGTTGTTCATCGCCTAGATATGGAGACCAGTGGAGCTGTTTTATTTGCT 

AAAAATCCATTTATACTTCCCCTTATCAATCAACGCTTAGAACGAAAAGAA 

ATTTGGCGTGAATATTGGGCTTTAGTTGAAGGAAAATTTTCACCTAAGCAT 

CAAGTTTTGAGAGACAAAATTGGACGGAACCGTCATGACAGACGTAAACG 

AATCATTGATTCTAAAAACGGTCAACATGCTATGACAATCATTGACGTTTT 

GAAGTATATCCAAAATAGTAGTCTCATAAAATGCCGACTGGAAACCGGAA 

GAACCCATCAAATTCGCATTCACTTATCTCATCACGGACATCCTTTAATAG 

GAGATCCCCTCTACAACCCTTCTTCTAATAATGAAAGGTTAATGCTACACG 

CTCACCGATTGACTCTATCCCATCCATTAACTTGCGAAACTATTAGCGTAG 

AGGCCCCTTCATCTACTTTCGAGAAGGTTTTAAACAATTATAAAAAAGGAG 

TTGGATAA 

MLVSSLVSCSFFLVISSLSSSTRNKSINLSNNFIDLLYQFSIKCYNKTMSFSLKIR 

NPYGEHTVK^LLEDYFLlPRKJIWFLRVKKHVLINNEFrNWQTVVQENDTlTLIF 

DDEDYPTKKIPLGRAELIDCLYEDEHLIIVNKPEGMKTHGNQPNEIALLNHVSA 

YSGQTCYVVHRLDMETSGAVLFAKNPFILPLINQRLEPvKEIWREYWALVEGKF 

SPKJiQVLRDKIGRNRHDPJUOUIDSKNGQHAMTIIDVLKYlQNSSLIKCRLETG 

RTHQ1RIHLSHHGHPLIGDPLYNPSSNNERLMLHAHRLTLSHPLTCET1SVEAPS 

STFEKVLNN YKKG VG • 



Sequence description: 

A] Length: 1029 bp - 343 aa (Full length gene sequence) 

B] No obvious Shine-Dalgarno sequence upstream 
of the putative TTG start codon. Possesses a 
potential leader peptide sequence. 
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ID- 107 



Clone 2-54 



GAACTAAATGCAACTCAACCTAATAATAGAACTACCTATATTATACCCGAA 

AGCAGTCATTCCATTGCAGAACAACAGAGATTCCTGATAGAATCAAAGGG 

TTCTTCGGTTGCATTACTTAATAGCGATGAATTTAGAAAGACAGCGGGAGA 

GGATAGAGG 1 TTI GAAAGGGATAAGTTGAGGTCTTTGGATATCATTCCTAA 

GGGAGATTTATCGACAAGTAATGTCATAGGTAATACGGACATTGCTAGTC 

AGATATCGTTGGGCTTTAAAAAGAATGCGATGCAGGAACACCATCTTACT 

AAAACATTCTCTCAAAAGGATGGAAAGTTATCGTCTGTTATAGAGGGGAT 

GCTTGCTATTGGCAAAGAGAAAGTAGAGAAAGAAATAAAATATAGTGGTA 

ATTTATGGCAAAAATTAAAAGCTAAGGCACACTGCCTTGTTTGCTGTGTTG 

ATAATTTGAATTTTGAAGATATAAAATCTTATTTTCAATATTATTGTCATCT 

AAA C CATC A GCTC AAATTACCTAAAG GTG CT ATACTTTCTGCT AAAAC A G A 

AGTATATAGGGGAGGAGATTTTGGGAGAAAAAATAAAGATAATGTGTTTG 

GTTACCGTATCCCCTCATTATTGAAAACCCAAAAAGGAACTTTACTTGCGG 

GAGCTGATGAAAGAATTGAGCAAGCTTGTGATTGGGGAAACATAGGAATG 

GTTATTCGCCGTAGTGAGGATGATGGTGTCACTTGGGGAAAAAGAGAAAC 

TATTGTCAATCTCCGTAATAACCCTAGAGTTCCGCTAGTTACTAGTGGTGA 

CTATAGTGGCTCACCTATTAATATGGATATGGCATTAGTTCAAGATACTAG 

CTCCAAGACGAAACGTA1 1 11 11 CAATATATGATATGTTTCCAGAAGGAAG 

AGGCGTTATTAGTATTGCTAACACACCTGAAAAAGAATATACCCAAATCG 

GAGGACAGTCTTATCTTAATTTATATAATAATGGAAAGAAATCGAAGGTTT 

TTACTATCCGTGACAAAGGTATTGTATATAATTTTAAAGGGAAAAAGACTG 

ATTATCATGTTATAACAGAAACTACTAAAAGTGACCATTCAAATCTAGGGG 

ATATTTATA A G G GAAAAC AG CT ACTTG G A AAT ATATATTTTAC AAAAC ATA 

AAACGTCACCATTTCGTTTAGCAAAATCAAGCTATGTGTGGATGTCATATA 

GCGATGATGATGGTAGGACATGGTCATCACCTAGAGATATAACAGCAAGT 

CTTCGTCAGAAAGGCATGAAATTTTTGGGAATAGGACCTGGAAAAGGTAT 

AGTTTTAAAATGGGGGCCACACGCTGGTCGTATTATTATTCCTGCCTATTCT 

ACGAATTGGAAATCTCATCTAAGAGGTrCACAATCTTCACGCCTAATTTAT 

TCAGACGACCATGGAAAAACGTGGCATACTGGAAAAGCAGTTAATGATAA 

CCGTATACTTTCTAATGGTGAAAAAATTCACTCCTTAACAATGGATAATAA 

AAAAGAACAAAATACAGAATCCGTACCCGTTCAATTGAAAAATGGGGACA 

TTAAGTTATTTATGAGGAATCTAACTGGTAACCTAGAAGTAGCCACAAGTA 

AAGACGGCGGGGAGACTTGGCAAAACCATGT TAAA CGATATAAGGAAATT 

CATGATGCTTACGTCCAACTATCAGCTATTCGCTTTGAGCATGACAAAAAA 

GAGTATATTTTATTAGTGAATGCTAATGGGCCAGGGAAGAAGTGCCAAGA 

TGGATATGCACGTCTAGCGCAAGTTAATCGAAATGGTAGTTTTAAGTGGTT 

ATATCACCATCACATTCAAGATGGTTCGTTTGCTTACAACTCTGTTCAACA 

ACTTAATAATGATCAATTTGGTGTCCTTTATGAACATAGAGAAAAACATCA 
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AAATAGTTTT ACTTT AAATT AC AAAGTTTTT AATTG G AGTTTTCTT AGTC AA 
AATACAGAGAAGCAAGGCACTTTATGGGAGAAAATGGCAGCAAATTGGCA 
TGTTTTGTTTAAATTTTATTTATGA 



ELNATQPNNRTTYIIPESSHSIAEQQRFLffiSKGSSVALLNSDEFRKTAGEDRGF 

ERDKLRSLDIIPKGDLSTSNVIGNTDIASQISLGFKKNAMQEHHLTKTFSQKDG 

KLSSVIEGMLAIGKEKVEKEIKYSGNLWQKLKAKAHCLVCCVDNLNFEDIKS 

YFQYYCHLNHQLKLPKGAILSAKTEVYRGGDFGRKNKDNVFGYRIPSLLKTQ 

KGTLLAGADER1EQACDWGNIGMVIRRSEDDGVTWGKRETIVNLRNNPRVPL 

VTSGDYSGSPINMDMALVQDTSSKTKRIFSIYDMFPEGRGVISIANTPEKEYTQI 

GGQSYLNLYNNGKKSKVFTIRDKGIVYNFKGKKTDYHVITETTKSDHSNLGDI 

YKGKQLLGNIYFTKHKTSPFRLAKSSYVWMSYSDDDGRTWSSPRDITASLRQ 

KGMKFLGIGPGKGIVLKWGPHAGRIIIPAYSTNWKSHLRGSQSSRLIYSDDHG 

KTWHTGKAVNDNRILSNGEKJHSLTMDNKKEQhTTESVPVQLKNGDIKLFMRN 

LTGNLEVATSKDGGETWQNHVKRYKEIHDAYVQLSAIRFEHDKKEYILLVNA 

NGPGKKCQDGYARLAQVNRNGSFKWLYHHHIQDGSFAYNSVQQLNNDQFG 

VLYEHREKHQNSFTLNYKVFNWSFLSQNTEKQGTLWEKMAANWHVLFKFYL 



Sequence description: 

A] Length: 2052 bp - 684 aa (partial gene sequence) 

B] N-terminus has yet to be determined 



ID- 108 



Clone 2-61 



ATGCCT AAATT AATCGTATCTTTCCTCTGCATTTT ATT ATCCCTGACTTGTG 
TAAACTCTGTG C AAG CTG A AG AAC AT AAAG ATATTATG C AAATTACC C G A 
GAAGCCGGATATGATGTTAAAGATATTAATAAACCTAAAGCGTCTATCGTT 
ATTGACAATAAAGGTCATATTTTGTGGGAAGATAACGCCGATTTAGAACGT 
GATCCCGCTAGCATGTCTAAAATGTrTACTTTATATTTACTATTTGAAGACT 
TAGCTAAAGGAAAAACAAACCTCAACACCACAGTGACTGCAACAGAAACA 
GACCAAGCCATAAGTAAGATTTATGAAATTAGTAATAACAATATTCATGCT 
GG GGTTG CTTATC CTATTC GTG AACTG ATTACT ATG ACGGCTGTCCCGTC A 
TCTAATGTAGCAACTATTATGATTGCTAACCACTTATCACAAAACAATCCT 
G AC G C CTTT ATTAAACG AATC AATG AA ACC G CC AAG AAACT CG GTA TG AC 
AAAAACTCACirTTATAACCCCAGTGGGGCGGTAGCGAGTGCTTTTAATGG 
ACTTTACTC C CC AAAA G AATAC G ATA AC AATGCTACT AACGTT A C G ACTG C 
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ACGTGATCTATCAATTTTAACCTATCATTTCCTTAAAAAATACCCTGATATA 

CTGAACTATACAAAATATCCTGAAGTCAAGGCCATGGTCGGAACTCCTTAT 

GAAGAAACATTTACAACTTATAACTACTCTACCCCCGGCGCTAAATT TGGA 

TTAG AAGG AGTAG ATGGCTTA AAAACTG GTTCTA G C CCTAGCGCTGCTTTT 

AATGCCTTAGTTACAGCTAAACGCCAGAATACTCGCTTGATAACTGTGGTT 

TTAGGAGTTGGCGATTGGTCAGACCAAGACGGAGAGTACTATCGTCATCC 

GTTTGTCAACGCTCTTGTAGAAAAAGGTTTTAAAGACGCTAAAAATATTTC 

TTCTAAAACTCCTGTATTAAAAGCCGTTAAACCTAAAAAAGAAGTTACTAA 

AACCAAAACTAAATCTATTCAAGAACAGCCTCAAACAAAAG AACA GTGGT 

GGACAAAAACAGATCAATTTATCCAATCACATTTTGTATCTATTTTAATTG 

TTCTGGGCACCATCGCTAGCCTTTGTCTTTTAGCTGGGATAGTATTACTTAT 

AAAGCGCTCTAGATAA 

MPKLIVSFLCILLSLTCVNSVQAEEHKDIMQITREAGYDVKDINKPKASIVIDN 

KGHILWEDNADLERDPASMSKMFTLYLLFEDLAKGKTNLNTTVTATETDQAI 

SKJYEISNNN1HAGVAYPIRELITMTAVPSSNVATIMIANHLSQNNPDAFIKRINE 

TAKKLGMTKTHFYNPSGAVASAFNGLYSPKEYDNNATNVTTARDLSILTYHF 

LKXYPDILNYTKYPEVKAMVGTPYEETFTTYNYSTPGAKFGLEGVDGLKTGS 

SPSAAFNALVTAKRQNTRLITWLGVGDWSDQDGEYYRHPFVNALVEKGFK 

DAKNISSKTPVLKAVKPKKLEVTKTKTKSIQEQPQTKEQWWTXTDQFIQSHFVS 

ILIVLGTIASLCLLAGIVLLIKRSR* 



Sequence description: 

A] Length: 11 88 bp - 396 aa (full length gene) 

B] Shine Dalgamo sequence present upstream of 
ATG start codon, possesses a potential signal 
peptide 



ID- 109 



Clone 45 



ATGACTGAAAAATATTATAATTGGGCAACGCTTGGAACCGGCGTTATTGCC 

AACGAATTAGCCCAAGCACTGGAAGCACGTGGACAAAAATTATATTCTGT 

AGCTAATAGAACTTACGACAAAGGACTTGAATTTGCTAACAAATATGGTA 

TCCAAAAAGTTTATGATCACATAGATCAAGTATTTGAAGA CCCTG AAGTGG 

ATATCATTTATATCTCTACTCCCCACAATACTCACATCTCATTTTTACGAAA 
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GGCTTTAGCAAATGGTAAGCACGTTCTTTGCGAAAAATCTATTACTTTAAA 

TAGTACTGAGCTTAAAGAAGCCATAGATTTAGCCGAAACTAACCATGTTGT 

CTTAGCTGAAGCCATGACTATTTTTCATATGCCAATTTACCGCCAATTAAA 

AACATTAGTTGATAGTGGAAAATTAGGACCGTTAAAAATGATTCAAATGA 

ATTTCGGAAGTTATAAAGAATATGATATGACTAACCG I 1 1 1 1 1 CAGTCGTG 

ACCTAGCAGGCGGTGCTTTGCTGGACATTGGTGTTTATGCACTTTCTTGTAT 

TCGCTGGTTTATGTCAGAAGCACCTCACAACATTACCTCTCAAGTTACATT 

TGCACCAACAGGGGTTGATGAACAAGTTGGTATCCTACTAACCAACCCAG 

CAAATGAGATGGCGACTGTCAGCCTTAGTTTACATGCAAAACAACCTAAA 

CGAGCAACTATCGCTTACGATAAAGGCTACATTGAAC 11111 GAATATCCG 

CGAGGACAAAAGGCAGTTATTACTTATACTGAGGATGGGCATCAAGATAT 

TATCGAAGCTGGCAAAACTGAAAATGCTCTCCAATATGAGGTAGCTGATA 

TGGAAGAAGCCATTTCAGGAAAAACTAACCACATGTACTTAAACTATACC 

AAAGATGTTATGGATATCATGACACAGCTACGTCAAGAATGGGGATTTAC 

CTACCCAGAAGAAGAAAAATGA 

MTEKYYNWATLGTGVIANELAQALEARGQKLYSVANRTYDKGLEFANKYGI 

QKVYDHlDQVFEDPEVDIIYISTPHhTTHISFLRKALANGKHVLCEKSlTLNSTEL 

KEAIDLAETNHWLAEAMTIFHMPIYRQLKTLVDSGKLGPLKM1QMNFGSYK 

EYDMTNRFFSRDLAGGALLDIGVYALSCIRWFMSEAPHNITSQVTFAPTGVDE 

QVGILLTNPANEMATVSLSLHAKQPKRATIAYDKGYIELFEYPRGQKAVITYT 

EDGHQDIIEAGKTENALQYEVADMEEAISGKTNHMYLNYTKDVMDIMTQLR 

QEWGFTYPEEEK* 



Sequence description: 

A] Length: 984 bp - 328 aa (full length gene) 

B] Shine Dalgamo sequence present upstream of 
ATG start codon, possesses a potential signal 
peptide 



ID-110 



Clone 2-2 



GTGTATTCTCCTGTTAAATCTTCTAAAGGAAAAGTGATATTGTTAAAAAGT 
GATTTTCTAAAGAGCTTCATAGAAAGGAGAGGAAATATTTGTTTT 

MYSPVKSSKGKVILLKSDFLKSFIERRGNICF 



FIG. 1 CONT'D 
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Sequence description: 

A] Length: 96 bp - 32 aa (partial sequence) 

B] GTG start codon - no obvious Shine-Dalgamo 
sequence 

Possesses a potential signal peptide 



ID-Ill 



Clone 2-3 



AAATACTGTATCATTGCAACCTCAAATGCAGGTTTTGGAAACGAAGCATTT 

ACAGGTGACAGCGATAAAGACTTGAAAATTATGGAACGAATTTCTCCATA 

TTTCCGTCCAGAATTTCTAAATCGTTTCAATGGTGTTATTGAATTCTCTCAC 

CTAAGCAAAGATGACTTAAGCGAAATTGTAGATTTGATGCTTGATGAAGTT 

AACCAAACAATTGGCAAAAAAGGAATTGACCTTGTGGTAGATGAAAATGT 

TAAATCACACTTAATTGAACTGGGTTATGACGAAGCAATGGGAGTACGTC 

CATTGCGCCGTGTCATCGAGCAAGAAATTCGAGATCGCATCACAGACTACT 

ATCTCGATCATACAGACGTTAAACACCTAAAAGCTAATTTGCAAGATGGCC 

AAATCGTCATTTCTGAAAGATAA 

KYCIIATSNAGFGNEAFTGDSDKDLK1MEPJSPYFRPEFLNRFNGVIEFSHLSKD 
DLSEIVDLMLDEVNQTIGKKGIDLVVDENVKSHLIELGYDEAMGVRPLRRVIE 
QEIRDRITDYYLDHTDVKHLKANLQDGQIVISER* 



Sequence description: 

A] Length: 429 bp - 143 aa (partial sequence) 

B] N-terminus yet to be elucidated. This gene 
was not in frame with nuc 



ID-112 
Clone 2-5 



FIG. 1 CONT'D 
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ATGTC AATG AATTTTTC ATTTTTACC A C AAT ATTG GTC CTATTTT AATTATG 

GTGTGATGGTAACCATTATGATTTCAACATGTGTTGi 1 1 1 I 1 ITGGAACTAT 

TATAGGCGTGTTAATTGCTTTAGTAAAGCGTACTAATTTACATTTTCTCACA 

ATATTAGCTAATTTCTATGTATGGGTATT TCGTG GGACACCGATGGTAGTT 

C AAATT ATG ATTG CTTTCGC ATG G ATG C ATTTT AAC AATTT A C C AAC AATT 

AGCTTTGGTGTTTTAGATTTAGATTTTACACGACl 1 1 I ACCTGGTATCATTA 

TCATTTCCTTAAATAGTGGTGCCTATATTTCGGAAATTGTACGTGCAGGGA 

TTGAGGCTGTACCATCTGGACAAATAGAAGCAGCTTACTCGTTGGGGATTC 

GACCTAAAAATACACTTCGCTATGTTATCTTACCCCAAGCTTTTAAAAATA 

TTTTACCTGCTCTAGGGAATGAATTTATTACAATTATTAAAGATAGTGCTCT 

CCTTCAAACTATTGGTGTCATGGAATTATGGAACGGAGCACAATCAGTTGT 

AACGGCTACTTACTCACCAGTTGCACCGTTATTATTTGCAGCATTTTACTAT 

TTAATGTTGACAACGATTCTCTCAGCTTTGTTAAAACAAATGGAGAAATAT 

CTTG G G AAAG GGGTAAAAATAG ATG GTTGA 

MSMNFSFLPQYWSYFNYGVMVTIMISTCVVFFGT1IGVLIALVKRTNLHFLTIL 

ANFYVWVFRGTPMVVQIMIAFAWMHFNNLPTISFGVLDLDFTRLLPGIIIISLNS 

GAYISEIVRAGIEAVPSGQIEAAYSLGIRPKNTLRYVILPQAFKNILPALGNEFITI 

IKDSALLQTIGVMELWNGAQSVVTATYSPVAPLLFAAFYYLMLTTILSALLKQ 

MEKYLGKGVKIDG* 



Sequence description: 

A] Length: 699 bp - 233 aa (full length gene) 

B] Shine-Dalgarno sequence preceded the 'ATG* 
start codon. Possesses a potential leader peptide 
sequence. 



ID-113 
Clone 2-7 



ATG AAAG AC CTATTACG AAAT AGTCTAG AG C AAAGTG G AAATTTAAGTTT 

TCAAGATATGATTTTACATATTCTTGTAGCAGCTTTATTGAGTGTAGTTATT 

TATGTTTCCTATGCTTATACGCATAGTGGAACTGCCTATAGTAAAAAGTTT 

AATGTTTCATTAATGACATTGACGGTCTTGACTGCAACAGTAATGACCGTT 

ATTGGTAATAATGTAGCCTTGTCATTGGGTATGGTCGGTGCCTTGTCAGTT 

GTTCGTTTTAGGACAGCCATAAAAGATTCAAGAGATACAGTTTATATTTTT 

TGGACCATAGTTGTTGGTATCTGTTGTGGTGTCGGTGACTATGTGGTAGCT 



FIG. 1 CONTD 
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GCATTAGGAAGTAGCGTTATCTTTATCTTATTATGGGTTATGGGACGTGTT 

AAAAACGAGAATCGTATGTTATTGATTGTGAAGTGCGATAGAACACTAGA 

AGTTGATTTAGAAGGAA 1111 CTTCCAATATTTTGACGGAAAAGCTGTTCA 

GCGTGTTAAAAATTCAACAACTAATACTATTGAAATGATTTTCGAAATCTC 

TAGAAAAGATTACGATAAGCAACTCCATGTAGATAATCAGTTAACTGAAA 

AAGTGTACCAATTGGGAAATATTGATTATTTCAACATTGTTAGCCAAAGCG 

ACGAAATCAATGGGTAG 

MKDLLRNSLEQSGNLSFQDMILHILVAALLSWIYVSYAYTHSGTAYSKKFNV 
SLMTLTVLTATVMTVIGWVALSLGMVGALSVVRFRTAIKDSRDTVYIFWTTV 
VGICCGVGDYVVAALGSSVIFILLWVMGRVKNENRMLLF/KCDRTLEVDLEGI 
FFQYFDGKAVQRVKNSTTmiEMIFEISRKDYDKQLPIVDNQLTEKVYQLGNID 
YFNTVSQSDEING* 



Sequence description: 



1D-114 
Clone 2-8 



A] Length: 678 bp - 226 aa (full-length gene) 

B] ATG start codon is preceded by a Shine- 
Dalgarno sequence-Possesses a potential leader 
peptide sequence 



AAAAATTCATTTTAGATTCA rri'lACG ACTATATACTCAGAAGTACCAAAC 

CTAATCCAAGGTTTGAAAAAAGAAAGAAGGAAGTCAGTATGACAAACTAT 

AAAAACAACTTTAAAGATGAGGCTATACGTGTTGAAGAGACAACAAAAGA 

ATCATTTTACGATGTTGATATTGCCTTGTTTTCAGCTGGTGGATCTATTTCA 

GCAAAGTTCGCTCCTTATGCAGTAAAGTCTGGAGCAGTTGTAGTAGATAAC 

ACGTCATATTTTCGTCAGAATCCTGATGTTCCACTAGTTGTTCCTGAAGTAA 

ATG CTC ATG CC ATG ATTGGTC AT AATGGT ATC ATAGCTTGTC C C AATTGTTC 

TACTATTCAAATGATGATTGCTTTAGAGCCCATTCGTCAAAAATGGGGGAT 

AGAGCGTGTTATAGTTTCCACCTATCAAGCTGTTTCGGGTTCAGGTGCACG 

TGCTGTTG AAGAAACTAAG G AAC AGTTGAG AC AAGTTTT 

KPILDSFYDYILRSTKPNPRFEKRKKEVSNfmYKNNFKDEAIRVEETTKESFYD 
VDIALFSAGGSISAKFAPYAVKSGAWVDNTSYFRQNPDVPLWPEVNAHAMI 
GHNGIIACPNCSTIQMMIALEPIRQKWGIERVIVSTYQAVSGSGARAVEETKEQ 
LRQV 



FIG. 1 CONTD 
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Sequence description: 

A] Length: 499 bp - 165 aa (partial sequence) 

B] N-terminus has yet to be determined 



ID-115 



Clone 2-9 



ATGACAAATGAATTGATAATGCAAGCTTTTGAGTGGTATTTACCTAGTGAT 

G G G AATC ACTG G AAG AAATTAG AG GAG TCTATATC AG ACCTTAAAAAACT 

TGGAATTAGTAAAATCTGGTTACCACCAGCATTTAAGGGAACTAGCAGTG 

ATGATGTAGGATATGGTGTTTATGATCTCTTTGATTTAGGAGAATTTGACC 

AGAATGGAACAATTAGAACAAAATATGGTAGGAAAGAAGAGTATCTAAA 

GCTTATTAAGTCGTTAAAGGCAAATGGCATTAAACCGTTTGCAGATATCGT 

TCTTAACCATAAAGCCAATGGTGATCATAAAGAAAAATTTCAAGTCATCA 

AAGTCAATCCTGAAAATCGTCAAGAAGCATTAAGTGAACCCTATGAGATT 

GAAGGATGGACGGGATTTGATTTCCCAGGTAGACAGGGTGAGTACAATGA 

TTTT 

MTNELIMQAFEWYLPSDGNHWKKLEESISDLKKLGISKIWLPPAFKGTSSDDV 
GYGVYDLFDLGEFDQNGTIRTKYGRKEEYLKLIKSLKANGIKPFADIVLNHKA 
NGDHKEKFQVIKVNPENRQEALSEPYEIEGWTGFDFPGRQGEYNDF 



Sequence description: 

A] Length: 456 bp - 152 aa (partial sequence) 

B] ATG start codon is preceded by a Shine- 
Dalgarno sequence, no leader peptide sequence. 



ID-116 



Clone 2-10 

FIG. 1 CONTD 
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ATGGAGGTTCTTATGAAGAAAGTGTTAGTAAGTAGTCTTTTGGTTTTAGGG 

ATTACGATAACGTTACAACCAGTAGTTGAGGCTAAGGGGCCAAAAGTAGC 

TTATACACAAGAGGGAATGACTGCTCTTTCGGACACAAATAAAGATAAAG 

TCACTACTATTTCTATTGACGAGATTCAAAAAAGCTTAGAAGGTAAGAAGC 

CGAT TACTGTTAGTTTTGATATTGATGATACACTGCTTTTCAGTAGTCAATA 

TTTTCAATATGGTAAAGAATATGTAACTCCTGGATCGTTTGATTTTCTTCAT 

AAACAAAAATTCTGGGATCTTGTTGCAAAACGAGGAGATCAAGATTCCAT 

TCCCAAAG AATATG CTAAAAAATTAATTGCTATGCATCAAAAACGAGGAG 

ATAAAATTG 1 1 1 11 ATAACAGGTAGGACAAGAGGGTCAATGTATAAGGAG 

GGCGAGGTTGATAAAACAGCTAAAGCCTTAGCTAAAGATTTTAAATTTGTA 
CCATCTGAT 

MEVLMKKVLVSSLLVLGITITLQPWEAKGPKVAYTQEGMTALSDTNKDKVT 
TISIDEIQKSLEGKKPITVSFDIDDTLLFSSQYFQYGKEYVTPGSFDFLHKQKFW 
DLVAKRGDQDSIPKEYAKKLIAMHQKRGDKIVFITGRTRGSMYKEGEVDKTA 
KALAKDFKFVPSD 



Sequence description: 

A] Length: 516 bp - 172 aa (partial sequence) 

B] ATG start codon is preceded by a Shine- 
Dalgarno sequence, Possesses a leader peptide 
sequence. 



ID-117 



Clone 2-17 



ATGCTTAAAAGATTATTTACTGAAGATGGGGAATTGACAAAGATTAGTCGT 

C GTTT CGTTTGGATGTTAGTGGTTATCTATTGTCTTATTATTGTCAGGATGT 

GTTTTG G G CCTC AAATTATGATTG AG G GG GTATC AA CTCCG AATGTTC AGC 

GCTTCGQAAGAATTGTAGCTCTTTTAGTACCATTTAATTCTTTTCGTAGTTT 

AGATCAGC TAAC TAGCTTTAAAGAGATTCTTTGGGTTATTGGTCAAAATGT 

AGTGAATATTTTACTGCTGTTTCCTCTCATTATAGGGTTACTATCCCTAAAG 

CCAA GTTrACGGAAATATAAAAGCGTTATATTACTTGCTTTCTTGATGTCTC 

TT TTCATA GAGTGTACTCAAGTTGTTTTAGATATTTTAATAGATGCTAATCG 

GGTTTTTGAAATCGACGATCTATGGACAAATACCTTAGGCGGTCCTTTCGC 

CCTATGGAGTTATCGAAACATAAAAGGTTGGCTTCTAACTATTAGAAAATG 
A 



FIG. 1 CONTD 
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MLKRLFTEDGELTKISRJU^VWMLVVIYCLIIVRMCFGPQIMIEGVSTPNVQRPG 
RTVALLVPFNSFRSLDQLTSFKEILWVIGQNVVNILLLFPLIIGLLSLKPSLRKYK 
SVILLAFLMSLFIECTQVVLDILIDANRVFEIDDLWTNTLGGPFALWSYRNIKG 
WLLTIRK* 



Sequence description: 

A] Length: 516 bp - 172 aa (full-length gene) 

B] ATG start codon is preceded by an Shine- 
Dalgamo sequence. Possesses a potential leader 
peptide sequence. C-terminus need further 
confirmation. 



ID-118 



Clone 3-3 



atg aaaaagcttacttttatttg gg atttag atg g g ac attaatag attcg 

tatgtaccaattatggaagctcttgaagaaacctatcgtcattttggctta 

atatttgataaagaattaatccatgaatatattttacaggaatcagtgggg 

caattattggtaaacctttcagaggaagagcaaatacctcatgaaaaact 

gaaagcatattttacaaaagaacaagaaagtcgagattctaaaatacatt 

taatg ccat atgcaaaagagattttagaatggaccaaagaacaagatatt 

cccaattttatgtatacacataaaggagcaagtacgcattcagtgttggaa 

accttgcagatctctcattattttgatgaaattttaactggtgtttcgggat 

tcgagcgaaaaccacatccacaagggattaattatttagttaaacgatatt 

ctttagataaatcaatgacttattacataggagatcgtccactagatttgg 

aggttgctcaaaatgctggtataaaatccataaacttaaggttagagaatt 

ccaaagaaaactataatatttcaagtctcaaagatataatatcacttgatt 

tcactcgtttggattaa 

mkkltfiwdldgtlidsyvpimealeetyrhfglifdkeliheyilqesvgqll 
vnlseeeqipheklkayftkeqesrdskihlmpyakeilewtkeqdipnfmyth 
kgasthsvletlqishyfdeiltgvsgferkphpqginylvkrysldksmtyyi 
gdrpldlevaqnagiksinlrlenskenynisslkdiisldftrld* 



Sequence description: 

A] Length: 627 bp - 209 aa (Possible Full-length gene) 

FIG. 1 CONTD 
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B] ATG start codon is preceded by an possible 
Shine-Dalgarno sequence. No obvious leader 
peptide sequence. 

ID-119 



Clone 3-7 



ATGGAAAAAGAAAAAAAATTAGGTCTTTTACCACTAACAATGCTTGTCATT 

GGCTCTCTTATCGGTGGCGGAATCTTTGATTTAATGCAAAATATGAGTTCC 

AGAGCCGGTTTGGTACCAATGCTTATTGCTTGGGTAATTACTGCTATCGGG 

ATGGGAACTTTCGTTTTAAGTTTTCAAAATTTATCTGAAAAAAGGCCGGAC 

CTAACAGCTGGAATCTTTAGTTACGCTAAAGAGGGGTTTGGAAACTTTATG 

GGATTTAACTCTGCATGGGGTTATTGGTTATCAGCTTGGCTTGGAAATGTT 

GCCTACGCTGCACTCTTATTCAGTTCACTCGGTTATTTCTTTAAATTCTTTG 

GT AATG G AAATAATATC ATCTC AATTATTG GAG C AAGT AT AGTTATTTG G G 

TTGTCCATTTCTTAATTTTAAGAGGTGTTAATACAGCTGCATTTATTAATAC 

CGTAGTTACCTTTGCAAAATTAGTACCTGTTATTA 1 ITTCI 1 AATTTCAGCG 

TTATTAGCTTTC AAATTTAAC A 1 11 1 1 AGTCTTGATATCTGGGGAAATGGAT 

TACATCAATCAATTTTCAACCAAGTCAATTCAACTATGAAAACCGCTGTTT 

GGGTATTTATTGGTATTGAGGGCGCCGTTGTCTTCTCAGGTCGTGCTAAAA 

AACACTCTGATATTGGTAAAGCAAGTATCCTAGCATTATTCACTATGATTT 

CACTTTATGTATTGATTTCTGTTTTATCACTTGGTATCATGTCACGTCCAGA 

ACTTGCAAACTTAAAAACACCAGCTATGGCTTACGTTCTAGAAAAAGCTGT 

TGGTCACTGGGGTGCTATCTTAGTTAACCTTGGTGTTATCATTTCAGTATTT 

GGCGCTATTCTTGC TTGGA CTTTATTTGCAGCAGAATTACCATATCAAGCT 

GCTAAAGAAGGTGCTTTTCCTAAA1 1 1 T1TGCAAAAGAAAATAAAAACAA 

AGCTC C AATC AACTC ACTCTTAGTC ACTAATCTTTGTGT AC AA G C ATTCTTA 

ATCACGTTCTTATTCACACAAAGTGCTTATCGTTTTGGTTTCGCATTAGCAT 

CATCTGCTATCTTAATTCCTTATGCTTTTACAGCACTATATCAATTACAATT 

CACACTCCGTGAGGATAAGTCAACTCCAGGACATCAAAAGAATTTAATTA 

TCGGTATCCTCGCTACAATCTATGCTGTTTACCTTATCTACGCTGGTGGTTT 

TGATTACTTACTTTTGACAATGATTGCTTATACTCTAGGTATGATTCTCTAT 

ATTAAAATGAGAAAAGATGACAAGCTTGGCGTAATCATGGTCATAGCTGT 

TTCCAGTGTGAAATTGTTATCC 



MEKEKKLGLLPLTMLVIGSLIGGGIFDLMQNMSSRAGLVPMLIAWVITAIGMG 
TFVLSFQNLSEKRPDLTAGIFSYAKEGFGNFMGFNSAWGYWLSAWLGNVAY 
AALLFSSLGYFFKFFGNGNNIISIIGASIVIWVHFLILRGVNTAAFIhTTVVTFAK 
LVPVIIFLISALLAFKFNIFSLDIWGNGLHQSIFNQVNSTMKTAVWVFIGIEGAV 

FIG. 1 CONTD 
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VFSGRAKKHSDIGKASILALFTMISLYVLISVLSLGIMSRPELANLKTPAMAYV 

LEKAVGHWGAILVNLGVIISVFGAILAWTLFAAELPYQAAKEGAFPKFFAKEN 

KNKAPINSLLVTNLCVQAFLITFLFTQSAYRFGFALASSAILIPYAFTALYQLQF 

TLREDKSTPGHQKNLIIGILATIYAVYL1YAGGFDYLLLTMIAYTLGMILYIKMR 

KDDKLGVIMVIAVSSVKLLS 



Sequence description: 

A] Length: 1356 bp - 452 aa (partial sequence) 

B] ATG start codon is preceded by an possible 
Shine-Dalgamo sequence. Possesses a potential 
leader peptide sequence. 



ID- 120 



Clone 3-8 



ATGAAATTTGAAAAACGGCAGGTCTATTATGTTGTCATAACATTTGCTATT 

TGCTATGCTATACAGGCTTATTGGGGAGCTGTTTCTAATATTTTAACTACGC 

TTCATAAGGCAATATTTCCl 1 1 1 11 GATGGGAGCTGGAATTGCCTATATTAT 

TAATATTGTAATGTCAGTCTATGAGCGATTATATATAAAGC I I 1 1 1 AAAGG 

ATCTAGACTATTAATGGCAATCAAGCGTAGTGTTTCTATGATTTTATCCTAT 

GC AACTTTT ATTGGTTT AATTGTCTG G CTATTTTC AATTGTC ATTCC AG ATT 

TG ATTTCT AG TTTG A GTTCTTT ATTG GTTATTG ATA C C G G A G C A CTTG CT AA 

ATTGGTTAATAATCTCAATGAAAATAAACAAATTTCTGAGGCTTTAAATTA 

TATGGGAACAGATAAAGACTTAGTTTCTACTTTAAGTGGTTATAGCCAGCA 

G ATTTTG AAGC AAGTTTTATCTG ' I 111 AACAAATTTACTAACCTCAGTTTCC 

TCTATTGCGGCAACACTTCTGAATG 11IT I GTTAGTTTTAIT 111 1 CAATTTA 

CGTTTTGGCAAACAAGGAGCAGTTGGGACGTCAATTTAATTTGTTAATTGA 

TACCTATTTAGGTTCAACAGGCAAAACATTCCATTACGTTCGTCATATCCTT 

CATCAACGTTTCCATGGTTTTTTTGTAAGCCAAACTTTAGAAGCTATGATTT 

TAGGAAGTTrGACGGTTATTGGTATGTTGATCTTCCAATTTCCTTATGCTTT 

AACAGTTGGGGTTTTAGTTGCTTTTACAGCTCTAATACCGGTTGTGGGAGC 

CTACATTGGTGTTACAATCGGTTTCATCTTAATTGCTACTGAATCGCTTACT 

G AAG C ATTCTTGTTTGTTCTTTTCTTG ATCC 1" 1 ' 11 ACAACAATTTGAGGGAA 

ATGTCATTTATCCGAAAGTTGTCGGTGGATCGATTGGACTGCCTTCTATGT 

GGGTTTTAATGGCTATTACTATCGGAGGTGCTTTATGGGGGATCTTAGGCA 



FIG. 1 CONTD 



TGTTACTTGCTGTTCCTGTTGCAGCTACTATCTATCAGATTGTAAAAGATCA 
TATTATCAAGCGACAAACGCTTAGAAATCGTGCACGAACCTATCGTTAA 

MKFEKRQVYYWITFAICYAIQAYWGAVSNILTTLHKAIFPFLMGAGIAYIINI 

VMSVYERLYIKLFKGSRLLMA1KRSVSMILSYATFIGLIVWLFSIVIPDLISSLSS 

LLVIDTGALAKLVNNLNENKQISEALNYMGTDKDLVSTLSGYSQQILKQVLSV 

LTNLLTSVSSIAATLLNVFVSFIFSIYVLANKEQLGRQFNLL1DTYLGSTGKTFH 

YVRH1LHQRFHGFFVSQTLEAMILGSLTVIGMLIFQFPYALTVGVLVAFTALIP 

WGAYIGVTIGFILIATESLTEAFLFVLFLILLQQFEGNVIYPKVVGGSIGLPSM 

WVLMAITIGGALWGILGMLLAVPVAATIYQIVKDHIIKRQTLRNRARTYR* 



Sequence description: 

A] Length: 11 34 bp - 378 aa (full-length gene) 

B] ATG start codon is preceded by an typical 
Shine-Dai garno sequence. Possesses a potential 
leader peptide sequence. 



ID-121 

Identical to ID-68, as described in WO 00/06736 



ID- 122 



Clone 3-16 



GTGATTACAATTAAAAAGGAATCTGTTATCAAACTATTGAAGTATGCTTTT 

GGCATTATAATGGGATTTATTATCTTAGCTATTGTAATAGGTGGGCTCCTA 

TTTGCATACTACGTTAGTCGTTCTCCGAAATTAACCGATCAAGCTTTAAAA 

TCCGTTAACTCTAGTTTGGTTTATGATGGTAATAATAAACTTATTGCCGATT 

TAGGCTCAGAAAAGCGTGAAAGTGTTAGTGCGGATAGCATTCCACTAAAT 

TrGGTTAACGCTATCACTTCTATAGAAGATAAACGTTTCTTTAAACATAGA 

GGTGTCGATATTTATCGTATTTTAGGTGCAGCTTGGCATAACCTTGTTAGTA 

GTAATACGCAAGGTGGTTCAACCCTTGATCAACAGTTGATTAAACTGGCTT 

ACTTTTCTACCAATAAATCTGACCAAACGTTAAAACGTAAATCACAGGAA 

GTTTGGCTTGCGCTTCAAATGGAGCGTAAATACACCAAAGAAGAAATTCTT 

ACTTTCTATATTAATAAAGTTTATATGGGAAATGGGAATTATGGTATGAGA 



FIG. 1 CONTD 



ACAACAGCTAAATCATACTTTGGTAAAGACCTAAAGGAATTATCTATTGCA 

CAACTTGCTTTGCTCGCTGGTATTCCTCAAGCACCTACACAATATGACCCTT 

ATAAAAACCCAGAATCTGCTCAAACAAGACGTAATACCGTTCTTCAGCAG 

ATGTATCAAGATAAAAACATTTCTAAAAAGGAATACGACCAAGCTGTTGC 

AACTCCAGTAACTGATGGCTTAAAAGAATTAAAGCAAAAATCTACTTATCC 

AAAATATATGGATAACTACTTAAAACAAGTTATTAGTGAAGTTAAACAAA 

AAACTGGTAAAGATATCTTTACTGCTGGGCTAAAAGTGTATACTAATATCA 

ACACTGATGCACAAAAACAACTATATGACATCTACAACAGTGATACTTAC 

ATCGCTTATCCAAACAATGAATTACAAATAGCATCTACCATCATGGATGCG 

ACTAATGGTAAAGTCATTGCACAATTAGGCGGGCGTCATCAGAATGAAAA 

TATTTCATTTGGGACAAATCAATCTGTCTTAACAGACCGCGATTGGGGTTC 

TACAATGAAACCTATCTCAGCTTATGCACCTGCTATTGATAGTGGTGTCTA 

TAATTCAACAGGTCAATCATTAAACGACTCAGTTTACTACTGGCCTGGTAC 

TTCTACTCAACTATATGACTGGGATCGTCAATATATGGGTTGGATGAGTAT 

GCAGACCGCTATTCAACAATCACGTAACGTCCCTGCTGTCAGAGCACTTGA 

AGCCGCTGGATTAGACGAAGCAAAATCTTTCCTTGAAAAATTAGGCATAT 

ACTATC C AG AAATG 

MIT1KKESVIKLLKYAFGIIMGFI1LAIVIGGLLFAYYVSRSPKLTDQALKSVNSS 

LVYDGNNKLIADLGSEKRESVSADSIPLNLVNAITSIEDKRPFKHRGVDIYRILG 

AAWHNLVSSNTQGGSTLDQQLIKLAYFSTNKSDQTLKRKSQEVWLALQMER 

KYTKEEILTFYINKVYMGNGNYGMRTTAKSYFGKDLKELSIAQLALLAGIPQA 

PTQYDPYKNPESAQTRRNTVLQQMYQDKNISKKEYDQAVATPVTDGLKELK 

QKSTYPKYMDNYLKQVISEVKQKTGKDIFTAGLKVYTNINTDAQKQLYDIYN 

SDTYIAYPNhfELQIASTIMDATNGKVIAQLGGRHQNENISFGTNQSVLTDRDW 

GSTMKPISAYAPAIDSGVYNSTGQSLNDSVYYWPGTSTQLYDWDRQYMGWM 

SMQTAIQQSRNVPAVRALEAAGLDEAKSFLEKLGIYYPEM 



Sequence description: 

A] Length: 1386 bp - 462 aa (partial sequence) 

B] GTG start codon is preceded by an 
typical Shine-Dalgamo sequence. Possesses a 
potential leader peptide sequence. 



ID- 123 



Clone 3-17 



FIG. 1 CONTD 
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ATGGCTAATGTATATGATTTAGCAAATGAATTAGAACGTGCTGTTCGTGCT 

TTACCAGAATACCAAGCAGTTTTAACTG CAAAA GCAGCTATTGAAAATGA 

TGCGGATGCACAAGTGCTTTGGCAAGACTTTTTGGCTACCCAATCAAAAGT 

TCAAGAAATGATGCAATCTGGCCAAATGCCAAGTCAAG AAGAA CAAGATG 

AAATGTCTAAACTTGGGG AAAAA ATTGAATCC AATGACC I i I 1 AAAAGTTT 

ATTTTGACCAACAACAACGGTTGTCTGTCTATATGTCTGATATCGAAAAAA 

TTGTCTTTGCACCCATGCAGGACTTGATGTAA 

MANVYDLANELERAVRALPEYQAVLTAKAAIENDADAQVLWQDFLATQSK 

VQEMMQSGQMPSQEEQDEMSKLGEKIESNDLLKVYFDQQQRLSVYMSDIEK1 

VFAPMQDLM* 



Sequence description: 

A] Length: 336 bp - 1 12 aa (full length sequence) 

B] ATG start codon is preceded by an 
typical Shine-Dalgarno sequence. No obvious 
potential leader peptide sequence. 



ID-124 



Clone 3-26 



ATGGCAGAAATCACAGCTAAACTTGTAAAAGAATTGCGTGAAAAATCAGG 
TGCAGGCGTTATGGACGCTAAAAAAGCATTAGTAGAAACTGATGGTGACC 
TTG ATAAA G CG ATTG AATTACTTC G CG AAAAAG GT ATGG CT AA AG C AG CT 
AAAAAAG C AGAC CGTGTTG CTGCTG AAGGTTTAACAGGTGTTTATGTTG AT 
GGTAACGTTGCAGCAGTTATTGAAGTTAA 

MAEITAKLVK£LPJEKSGAGVMDAKKALVETDGDLDKAIELLREKGMAKAAK 
KADRVAAEGLTGVYVDGNVAAVIEV 



Sequence description: 

A] Length: 230 bp - 76 aa (partial sequence) 

B] ATG start codon is preceded by an 
typical Shine-Dalgamo sequence. No obvious 
potential leader peptide sequence. 



FIG. 1 CONTD 
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ID- 125 



Clone 3-33 



ATGATAAAAAACCTGTTATTAACAGGTTTTTTATCATTTAATGACGGAAAA 
CTGGACACAAATTATTTTTCTTGTATAATTAAATATATTATTTCTTATCAGG 
AGGTTATGATGACATTAGAGAAACGATTTAA 

MIKNLLLTGFLSFNDGKLDTNYFSCIIKYIISYQEVMMTLEKRF 



Sequence description: 

A] Length: 1 34 bp - 44 aa (partial sequence) 

B] ATG start codon is preceded by an 
typical Shine-Dalgarno sequence. Possible 
potential leader peptide sequence. 



ID-126 



Clone 3-41 



ATG AAAAATAAT A AAAAT AATG GTTTTCTG AAA AATTC CTTT ATTTAC ATA 
TTATTGATTATTGCGGTTATTACAACCTTTCAATACTATTTAA 

MKNNKNNGFLKNSF1YILLIIAVITTFQYYL 



Sequence description: 

A] Length: 94 bp - 31 aa (partial sequence) 

B] ATG start codon is preceded by a 
possible Shine-Daigarno sequence. Potential 
leader peptide sequence. 
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ID-127 



Clone 3-42 



ATGTTAGATATTATCTTATCCGGAATTTCGCAAGGATTACTTTGGTCAATTA 
TGGCAATTGGCGTGTTTATCACTTTTCGTATCTTAGACATAGCCGATCTCTC 
TGCAGAAGGGGCTTTCCCTATGGGGGCTGCAGTTTGCGCCTTATGTATCGT 

TAA 

MLDIILSGISQGLLWSIMAIGVF1TFRILDIADLSAEGAFPMGAAVCALCIV 



Sequence description: 

A] Length: 158 bp - 52 aa (partial sequence) 

B] ATG start codon is preceded by a 
possible Shine-Dalgamo sequence. Potential 
leader peptide sequence. 



ID-128 



Clone 3-43 



ATGGAAATGCCTAAAAGAAATGAATTACTCAATAAAGAAATTAAAATGAG 
TATTGATAAACTTAGATATAAAGAACCAGAGAGTGAACATGACAAGCGAC 
CTACTTTTTATTTGGTAGTACTTATACTTGTTACTGTAGCAGTTATATTGTC 

GTTATTTAA 

MEMPKRNELLNKEIKMSIDKLRYKEPESEHDKRPTFYLVVLILVTVAVILSLF 



Sequence description: 

A] Length: 161 bp - 53 aa (full-length gene) 

B] ATG start codon is preceded by a 
possible Shine-Dalgamo sequence. Potential 
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leader peptide sequence. 



ID- 129 



Clone 3-44 



GTGGTAAGTAAATTGAGTTTAACAACGATTTTTGC 

ATG CT AATTTACG C AAC AC CTCTT ATCTTTAC AAGT ATTG G G GG AAC CTTC 

tctg^gtg^^^^ 

GGAGCTTTCTCAGGCGTTGTATTTAA 

MVSKLSL1TIFALLFSSMLIYATPLIFTS1GGTFSERGGIVNVGLEGIMVIGAFSG 
WF 



Sequence description: 



A] Length: 179 bp - 59 aa (partial sequence) 

B] GTG start codon is preceded by a 
possible Shine-Dalgarno sequence. Potential 
leader peptide sequence. 



ID-130 
Clone 3-46/47 

ATGAGAATTATTGCAATAACTGAAAAGGTTATy\AAA 

AAAAGAACACTTGCTATGATGTTTTTAGCACCTATTTTAATTATGTTTTTGA. 

tt a a rr.tta ac acg aaggtcgtttcaaatttag ataatattaagc atattc 
aIotga^ca^ 

ataaaattgatgctcttatttcggaggacaataaatcttatactgtcttct 

atg^tgaatacagattcttcaaagacgactttaacaagaca^ 
accgctgttaa^caatgaacagtaaggaactgatttcgcaagttaaaatt 

tta^c^aataagaatccgaaacta 

TATATCAAAGAAAAATATAATTACGGAAATAAAAATACAGGCTT^ 
AAA^3n^3ATACCAATACTAATGGGAT 
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MRIIAITEKVIKELFRDKRTLAMMFLAPILIMFLMNVMFSANSNTKVKIGTFNV 
NTKWSNLDN1KHIQVRSFKFNSSAKKALKSNKIDALISEDNKSYTVFYANTDS 
SKTTLTRQAFKTAVNTMNSKELISQVKILANKNPKLAQSLQTRSKYIKEKYNY 
GNKNTGFFAKMIPILMGFMVFFLVF 



Sequence description: 

A] Length: 558 bp - 186 aa (partial sequence) 

B] ATG start codon is preceded by a 
possible Shine-Dalgarno sequence. Potential 
leader peptide sequence. C-terminus has yet to be 
determined. 



ID-13I 



Clone 3-48 



GTG ATTATC GTT ATG AGT AAAC ATC AAG AAATTTTG G A GTA CCTAG AA A AT 
TTAGCTGTTGGTAAGAGGGTTAGTGTACGCAGTATTTCAAATCATTTAA 

MIIVMSKHQEILEYLENLAVGKRVSVRSISNHL 



Sequence description: 

A] Length: 100 bp - 33 aa (partial sequence) 

B] GTG start codon is not preceded by a 
obvious Shine-Dalgamo sequence. No obvious 
leader peptide sequence. 



ID-132 



Clone 2-c53 
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ATGTATAGAGAAATTACCGCTGTCGAACACGATCGCTTTGTGAGCGAATCC 

AACCAAACAAACCTACTTCAATCTCTTAATTGGCCCAAAGTAAAAGACAA 

CTGGGGTAGTCAATTACTTGGCTTTTTTGACGGTGAAACCCAAATTGCCAG 

CGCTAGTATTCTCATCAAATCACTTCCTCTTGGCTTCTCCATGCTGTATATT 

CCGCGTGGACCAATCATGGATTACTCCAATCTAGATATTGTAACTAAGGTC 

CTTAAGGACCTTAAAGCTTTTGGCAAAAAACAAAGAGCTCTCTTTATCAAG 

TGTGATCCTCTCATCTATTT 

MYREITAVEHDRJFVSESNQTNLLQSLNWPKVKDNWGSQLLGFFDGETQIASA 
SILIKSLPLGFSMLYIPRGPIMDYSNLDIVTKVLKDLKAFGKKQRALFIKCDPLI 
Y 



Sequence description: 



A] Length: 326 bp - 108 aa (partial sequence) 
BJ ATG start codon is preceded by an obvious 
Shine-Dalgamo sequence. No obvious leader 
peptide sequence. 



ID- 133 



Clone 2-c59 



ATG G A C AAG AAAAAA ATCTT AGT AACGG GTATTGTG C CTAAA G AA G GTC T 

AAGAAAGCTTATGGACCGATTTGATGTTACTTATTCAGAAGATCGCCCATT 

TTCACGTGACTATGTGTTAGAGCATTTATCTGAATATGACGGATGGTTACT 

CATGGGACAAAAAGGTGATAAAGAGATGATTGATGCAGGTGAAAACTTAC 

AAATTATTTCTTT 

MDKKKILVTGIVPKEGLRKLMDRFDVTYSEDRPFSRDYVLEHLSEYDGWLLM 
GQKGDKEMIDAGENLQIIS 



Sequence description: 



A] Length: 215 bp - 71 aa (partial sequence) 
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B] ATG start codon is preceded by an obvious 
Shine-Dalgarno sequence. No obvious leader 
peptide sequence. 



ID-134 



Clone 2-c62 



ATTTCGAAAGATGACTACCAAAATATTAGTTTTGGACAGGATCCAGAAGTT 

GTTGATTATGCTGGTCTGTTTGAAAAACGCCGTCCAGTTTTAGAAAAAGCA 

GTTAAAAATTTCTTGCAAGAAGAGAGAGCTACGAGAATGCTATCTGATTTC 

TTGCAA GAAG AAAAATGGGTAACTGATTTTGCTGAATTTATGGCGATCAA 

AGAACA riTlGGT AATAAGGCGCTTCAAGAATGGGATGACAAGGCTATTA 

TACGCCGCGAAGAAGAAGCCTTAGCAGGATATCGTCAAAAGCTTAGTGAA 

GTGATAAAATATCATGAAGTAACGCAATATTTCTTTTACAAACAATGGTTT 

GAGTTAAAAGAATATGCTAATGATAAAGGGATTCAAATTATCGGTGATAT 

GCCAATCTACGTTTCTGCCGATAGTGTAGAAGTTTGGACAATGCCTGAACT 

GTTT 

ISKDDYQNISFGQDPEVVDYAGLFEKRRPVLEKAVKNFLQEERATRMLSDFLQ 

EEKWVTDFAEFMAIKEHFGNKALQEWDDKA1IRREEEALAGYRQKLSEVIKY 

HEVTQYFFYKQWFELKEYANDKGIQIIGDMPIYVSADSVEVWTMPELF 



A] Length: 459 bp - 153 aa (partial sequence) 

B] More sequencing is required to determine the 
N- and C-termini 

enzyme). - Streptococcus pneumoniae (63%) 

ID- 135 

Identical to ID- 108 described in WO 00/06736 



Clone 2-c63 
ID-136 



Clone 2-c66 
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ATGGCAAAACAGAAAAATAACTGGCGCCGTGTTGGAGTTGGTGTCCTTAC 

ACTTGCTTCAGTTGCGACTCTTGCTGCATGTGGAAGTAAATCAGCTTCCCA 

GGATTCTAATGGAGCGATTAATTGGGCTATTCCAACAGAAATCAATACACT 

AGATTTATCTAAAGTTACAGACACTTACTCAAATCTAGCTATTGGTAACTC 

TAGTAGTAATTTCCTTCGCTTAGATAAAGATGGAAAGACAAGACCAGACTT 

GGCTACTAAAGTTGATGTTTCAAAAGATGGCTTAACTTATACAGCTACATT 

ACGTAAAGGCTTGAAGTGGTCAGATGGCAGTAAACTTACTGCAAAGGATT 

TTGTTTATTCATGGCAACGTTTAGTTGATCCTAAAACAGCTTCACAATATG 

CTTACCTTGCTGTTGAAGGGCATGTGCTTAATGCCGATAAAATCAACGAAG 

GACAAGAGAAAGACTTGAATAAGCTAGGTGTTAAGGCAGAAGGCGATGA 

CAAAGTTGTTATTACTTTATCTAGTCCGTCTCCGCAATTCATCTACTACCTT 

GCATTCACTAACTTCATGCCACAAAAACAAGAAGTTGTTGAAAAATATGG 

AAAAGATTACGCAACTACTTCAAAAAATACAGTTTACTCAGGACCATATA 

CTGTTGAAGGTTGGAATGGTTCGAATGGTACTTTCACGCTGAAGAAAAAC 

AAAAATTATTGGGACGCTAAAAATGTAAAAACAAAAGAAGTTCGCATCCA 

GACTGTTAAAAAACCAGATACCGCCGTTCAAATGTATAAACGTGGTGAGT 

TAGATGCAGCTAATATCTCAAATACTTCTGCTATTTATCAAGCTAATAAAA 

ATAA TAAA GATGTC AC AG ATGTTCTAG AAG CG ACC ACTGC CT AT ATGG AA 

TATAATACTACTGGTTCTGTGAAAGGGCTTGATAATGTTAAGATTCGTCGC 

GCCTTAAACTTAGCAACTAACCGTAAAGGAGTTGTTCAAGCAGCCGTTGAT 

ACAGGCTCAAAACCGGCAATTGCTTTTGCACCTACTGGTTTAGCCAAAACA 

CCAGATGGAACTGATTTGGCAAAATATGTTGCCCCAGGTTATGAATATAAT 

AAAACTGAAGCAGCAAAACTCTTTAGACTA 

MAKQKNNWRRVGVGVLTLASVATLAACGSKSASQDSNGAINWAIPTEINTLD 

LSKVTDTYSNLAIGNSSSNFLRLDKDGKTRPDLATKVDVSKDGLTYTATLRKG 

LKWSDGSKLTAKDFVYSWQRLVDPKTASQYAYLAVEGHVLNADKINEGQEK 

DLNKLGVKAEGDDKVVITLSSPSPQFIYYLAFTNFMPQKQEVVEKYGKDYAT 

TSKNTVYSGPYTVEGWNGSNGTFTLKJCNKNYWDAKNVKTKEVRJQTVKKPD 

TAVQMYKRGELDAANISNTSAIYQANKNNKDVTDVLEATTAYMEYNTTGSV 

KGLDNVKIRRALNLATNRKGVVQAAVDTGSKPAIAFAPTGLAKTPDGTDLAK 

YVAPGYEYNKTEAAKLFRL 



Sequence description: 



A] Length: 1 143 bp - 381 aa (partial sequence) 

B] Shine-Dalgamo sequence precedes ATG codon. 
Possesses a potential leader peptide sequence. 
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ID-137 



Clone 2-c67 



TTGAGAGTTTATGAAAATAAAGAAGAGTTGAAAAAAGAAATAAGTAAAAC 
ATTTGAGAAATACATTATGGAATTTAATAA 

TATTCCAGAGAATCTAAAAGATAAAAGAATTGATGAAGTTGATAGAACTC 
CAGCAGAAAACCTTTCTTATCAGGTTGGCT 

GGACCAACTTGGTTCTTAAATGGGAAGAAGATGAAAGAAAGGGACTTCAA 
GTAAAAACACCATCGGATAAATTT 

MRVYENKEELKKEISKTFEKYIMEFNN1PENLKDKRIDEVDRTPAENLSYQVG 
WTNLVLKWEEDERKGLQVKTPSDKF 



Sequence description 

A] Length: 234 bp - 78 aa (partial sequence) 

B] TTG start codon is preceded by a 
potential Shine-Dai gamo sequence. No obvious 
leader peptide sequence. 



ID-138 



Clone 2-c70 



ATGTCAAAGTTTGATAGTCAGAAAATAATTACTCCGATTATGAAGTTTGTC 
AATATGCGAGGGATTATTGCACTCAAAGATGGCATGCTAGCAATTTTACCA 
CTAACAGTTGTTGGGAGTCTCTTTTTAATATTAGGGCAGCTTCCATTT 

MSKFDSQKIITPIMKFVNMRGIIALKDGMLAILPLTVVGSLFLILGQLPF 



Sequence description 

A] Length: 150 bp - 50 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
Shine-Dalgamo sequence. Possesses a potential 
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leader peptide sequence. 



ID-139 



Clone 2-c71 



GAGACCACTTCATCAGTTAAACCAGCAGGAATTGACCGTATCAATCATACC 

TCAACACCCCCGAAGAAAACTACCCCCAACATTGCAACGACGCATAGCTT 

CAAAGATCGTTGTGATACTTTAGAAAGAATTCACAATGAAGACATTGATGT 

TTGTTCTGGATTCATTTGTGGTATGGGAGAGAGCGATGAGGGGCTCATCAC 

ATTAGCTTTCAGACTAAAAGAACTGAACCCCTATTCTATCCCTGTCAATTTT 

TTACTTGCTGTTGAAGGAACACCTCTTGGAAAATATAACTATTTGACTCCC 

ATTAAATGCTTAAAAATTATGGCCATGTTGCG11 TTGTTTTTCCTTTCAAGG 

AATTAAGATTAAGTGCTGGACGGGAGGTCCATTTTGAGAATTTTGAATCAT 

TAGTCACCTTACTTGTTGACTCAAC 1T1 TJ 1 GGGAAATTACCTAACAGAGG 

GGGGTCGCAATCAACATACCGATATTGAATTCTTGGAAAAATTACAACTA 

AATCATACTAAAAAGGAATTAATTT 



ETTSSVKPAGIDRINHTSTPPKKTTPNIATTHSFKDRCDTLER1HNEDIDVCSGFI 
CGMGESDEGLITLAFRLKELNPYSIPVNFLLAVEGTPLGKYNYLTPIKCLKIMA 
MLRFVFPFKELRLSAGREVHFENFESLVTLLVDSTFLGNYLTEGGRNQHTDIEF 
LEKLQLNHTKKELI 



Sequence description: 



A] Length: 535 bp - 178 aa (partial sequence) 
BJ N- and C-termini require verification 



1D-140 



Clone 2-c73 



ATGCCG GTTTG G ACTG C AC AGTCTATTCC AAAGG C ATTTTTAG AAAAG CAT 
AATACTAAGGAAGGCACCTGGGCAAAACTAACCATTCTAAGTGGTTCTTTA 
GTATTTTACCAGTTATCTCCTGATGGAGAGGAAATCTCGCGGCATATTTTT 
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GATGCTAGTAGTGATATTCCTTTTGTTGATCCACAAGTCTGGCATAAAGTT 

TCGCCGAATAGTCCAGACTTAAGTTGCTATCTAACTTTTTACTGCCAAAAA 

GAAGATTACTTCCATAAAAAATATGGTCTCACGCGCACACATTCTGAGGTT 

ATCGCCAGTGCACCTCTCTTATCTGAGAAGAGTAATATATTAGACCTTGGG 

TGTGGTCAAGGGCGAAACTCACTTTATTTATCGCTGCTGGGACATCAAGTG 

ACTTCTGTCGATTCAAACGGACAGAGCCTTGTAGCTTTAGAAAATATGGCA 

TTAGAAGAAGAGCTTCCTTACAATATAAAAAGGTATGATATTAATACTACT 

G CTATTG AA G G G C A C T ATG ATTTT ATTTT ATC AACTGTG GT ATTTA TGTTTT 

T 

MPVWTAQSIPKAFLEKHNTKEGTWAKLTILSGSLVFYQLSPDGEEISRHIFDAS 
SDIPFVDPQVWHKVSPNSPDLSCYLTFYCQKEDYFHKKYGLTRTHSEVIASAP 
LLSEKSNILDLGCGQGRNSLYLSLLGHQVTSVDSNGQSLVALENMALEEELPY 
NIKRYDINTTAIEGHYDFILSTVVFMF 



Sequence description: 



A] Length: 563 bp - 187 aa (partial sequence) 

B] N- and C-termini require verification 



ID-141 



Clone 2c76 



ATGACAAAGCAAATAATTGCCATTTGGGCTGAAGATGAAGACCATTTGAT 

TGGAGTTAATGGCGGTTTACCATGGAGGCTTCCTAAAGAGTTACATCACTT 

CAAAGAAACGACCATGGGGCAGGCTTTGCTTATGGGACGAAAGACCTTTG 

ATGGAATGAACCGTCGTGTTTTACCTGGTAGAGAGACAATCATCTTAACAA 

AAGATGAACAATTCCAAGCAGATGGAGTGACAGTCCTAAATAGTGTTGAA 

CAAGTTATAAAATGGTTTCAGGAACATAATAAGACCTTATTTATTGTAGGT 

GGTGCAAGTATTTATAAAGCATTTCTGCCTTATTGTGAAGCAATCATAAAA 

ACTAAAGTTCATGGAAAATTCAAAGGTGATACCTATTTTCCTGATGTTAAT 

CTATCTGAGTTT 

MTKQnAIWAEDEDHLIGVNGGLPWRLPKELHHFTCETTMGQALLMGRKTFDG 

MNRRVLPGRETIILTKDEQFQADGVTVLNSVEQVIKWFQEHNKTLFIVGGASI 

YKAFLPYCEAIIKTKVHGKFKGDTYFPDVNLSEF 
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Sequence description: 



A] Length: 41 7 bp - 139 aa (partial sequence) 

B] ATG start codon is preceded by a Shine- 
Dalgarno sequence. No leader peptide sequence 



ID- 142 



Clone 2-c78 



TTGTGGCCAAACTGTGCCCCGCTTATTAATAGCACTTTGTTCACCATTGAA 

GATATCTTAACATCAGGTGCTCATAGCAACCCTATTTTAATGGGGGTTATA 

CTTGGCGGGACAATTGTAGTAGTGGCGACAGCACCACTTTCTTCTATGGCA 

TTGACAGCTATGCTAGGATTAACCGGAATGCCTATGGCTATAGGAGCCTTG 

TCTGTCTTTGGTTCGTCATTTATGAATGGTGTACTTTTCCATAAATTAAAAC 

TTGGAAGTCGTAAAGATAATATAGCTTTTGCTGTTGAGCCTCTAACTCAAG 

CTGACGTGACTTCAGCTAACCCTATTCCAATCTATGTCACTAATTTTGTTGG 

TGGTGCAGCTTGTGGTATTTTAATTGCCTTGATGAAATTAGTTAATGATACT 

CCTGGAACAGCGACACCAATTGCAGGATTTGCTGTCATGTTTGCCTATAAC 

CCAATGATAAAAGTACTAATAACCGCTCTAGGTTGTATTATCCTATCTTTA 

CTAGCAGGCTATTTTGGAGGCATTGTTTTT 

MWPNCAPLINSTLFT1ED1LTSGAHSNPILMGVILGGTIVVVATAPLSSMALTA 
MLGLTGMPMAIG ALS V FG SSFMNG VLFHKLKLG SRKDN 1 A F A VEPLTQ ADV T 
SANPIPIYVTNFVGGAACGILIALMKLVNDTPGTATP1AGFAVMFAYNPMIKVL 
ITALGCIILSLLAGYFGGIVF 



Sequence description: 



A] Length: 540 bp - 180 aa (partial sequence) 

B] N- and C-termini have yet to be elucidated 



ID- 143 
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Clone 2-c80 



ATG 1 1111 AAGTATAATGGCAGGTGTCATAGCATTTGTCCTGACAGTT ATT . 
GCCATTCCACGCTTCATTAAGTTTTACCAATTGAAGAAAATTGGCGGGCAA 
CAAATGCATGAAGATGTCAAACAACATCTAGCCAAAGCAGGTACGCCGAC 
AATGGGAGGAACGGTATTTT 

MFLS IM AG VI AFV LTVI AIPRFIKFYQLKKIGGQQMHED VKQHLAKAGTPTMG 
GTVF 



Sequence description: 



A] Length: 172 bp - 57 aa (partial sequence) 

B) Shine Dalgarno sequence precedes 'ATG* start 
codon. Possesses a potential leader peptide 
sequence. 



ID- 144 



Clone 3-83 



ATGAAACCATATTTATCTTTTATTGGTAGAACGTTATTATACTTCGGTATTT 
TATTGTTACTAATTTAC I TTTTI GCATACCTTGGTCGCGGACAAGGCAGTTT 
TATTTATAA 

MKPYLSFIGRTLLYFGILLLLIYFFAYLGRGQGSFIY 



Sequence description: 

A] Length: 1 1 3 bp - 37 aa (partial sequence) 

B] Putative ATG start codon is preceded by a 
typical Shine-Dai garno sequence. Possesses a 
potential leader peptide sequence. 

This orf is not in frame with nuc 
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ID-145 



Clone 3-86 



ATGTCATATTTTAGAAATTACTGGTATCGTTTTGGAGCAATTTTATTTATTA 

TTTTAGCAGTAATATTGCTTGTTTTTAGACCTGACTGGTCAATGCTTCACTA 

TCTATTGTA VI i i i ACTTTATGGCACTTCTAGCGCATCAATTTGAAG AATAT 

CAGTTTCCCGGTGGGGCATCACCTATCATTAACTATGTTGTTTATGATGAA 

GAAGAGCTGATGGATTGTTTTCCAGGCAATACTCAGTCTATTATGTTGGTT 

AATACTATTGCTTGGTTGCTTTACATTGCTAGTATTGCTTTTCCTCAAGCTT 

ATTGGCTTGGATTAGGAGTCATGTTCTTTAGTCTAACGCAGCTCTTGGGTC 

ATGGTTTTCAGATGAATATTAAACTTAAAACTTGGTATAATCCTGGTCTAG 

CAACGACAGTATTTCTCCTAGTACCAATAGCTTGCGCATACATCTATCAAG 

CTAGTGCAGAAGGAATGCTCACTTGGGGAGATTGGCTAGGTGGTTTTATCA 

TGTTGATTGTCTGTGTACTAACTAGCATTATTGCACCTGTACAGCTATTGAA 

GGATAAGGAGACCAATTATATTATTAGTCCTTGGCAAATGGACCGTTTTCA 

TAAGGTCGTTAATTTTGTAAGGATAAAAAAATAA 

MSYFRNYWYRFGAILFIILAVILLVFRPDWSMLHYLLYFYFMALLAHQFEEYQ 

FPGGASPIINYWYDEEELMDCFPGNTQSIMLVNTIAWLLYIASIAFPQAYWLG 

LGVMFFSLTQLLGHGFQMNIKLKTWYNPGLATTVFLLVPIACAYIYQASAEG 

MLTWGDWLGGFIMLrVCVLTSIIAPVQLLKDKETNYIISPWQMDRFHKVVNFV 

RIKK* 



Sequence description: 



A] Length: 651 bp - 219 aa (full length gene) 

B] Putative ATG start codon is preceded by a 
typical Shine-Dalgamo sequence. Possesses a 
potential leader peptide sequence. 



ID-146 



Clone 3-c88 
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ATGCCACTTACAGCACTTGAAATTAAAGATAAAACATTTTCATCAAAATTT 
CGCGGTTATAGCGAAGAAGAAGTT 

MPLTALEIKDKTFSSKFRGYSEEEV 



Sequence description: 

A] Length: 75 bp - 25 aa (partial sequence) 

B] Putative ATG start codon is preceded by a 
typical Shine-Dalgarno sequence. No leader 
peptide 



ID-147 



Clone 3-90 



ATGTCACTTTTTCAAGAAAAAATTGCTTACAATTGCGCTAAAAAGGAAGCG 

CTTTATAAAGAGAGTTTAGGACGCTACGCCTTGAGATCAATGCTAGCAGG 

GGCTTATTTGACAATGAGTACTGCTGCCGGTATCGTCGCAGCTGATACTAT 

TGGTAAAATTTCTCCTGCTCTATCAGGTTTTGTATTTGCTTTCATCTTTAGTT 

TTGGACTTATTTATGTTTTAATATTTAATGGTGAATTGGCGACATCTAATAT 

GCTTTATCTCACTGCAGGAGCCTATAATAAAAATATCTCTTGGAAAAAAGC 

CATAACAATTTTAATTTATTGTACTTTTTTCAACCTCGTTGGTGCTTGTATA 

TTAG CTTGGTTGTTTAA 

MSLFQEKIAYNCAKKEALYKESLGRYALRSMLAGAYLTMSTAAGIVAADTIG 
KISPALSGFVFAFIFSFGLIYVLIFNGELATSNMLYLTAGAYNKNISWKKAITILI 
YCTFFNLVGACILAWLF 



Sequence description 

A] Length: 406 bp - 125 aa (partial sequence) 

B] Putative ATG start codon is preceded by a 
typical Shine-Dalgarno sequence. Possible 
leader peptide 
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ID-148 



Clone 3-92 



AAGTTACAAGCGACTGAAGTTAAGAGCGTTCCGGTAGCACAACCAGCTTC 

AACAACAAATGCAGTAGCTGCACATCCTGAAAATGCAGGGCTCCAACCTC 

ATGTTG C AGCTT ATAA AG AAAAAGTAG CGTC AACTTATG G AGTT AAT G AA 

TTCAGTACATACCGTGCGGGAGATCCAGGTGATCATGGTAAAGGTTTAGC 

AGTTGACTTTATTGTAGGTAAAAACCAAGCACTTGGTAATGAAGTTGCACA 

GTACTCTACACAAAATATGGCAGCAAATAACATTTCATATGTTATCTGGCA 

ACAAAAGTTTTATTCAAATACAAATAGTATTTATGGACCTGCTAATACTTG 

GAATGCAATGCCAGATCGTGGTGGCGTTACTGCCAACCACTATGACCACGT 

TCACGTATCATTTAA 

KLQATEVKSVPVAQPASTTNAVAAHPENAGLQPHVAAYKEKVASTYGVNEF 
STYRAGDPGDHGKGLAVDFIVGKNQALGNEVAQYSTQNMAANNISYVIWQQ 
KFYSNTNSIYGPANTWNAMPDRGGVTANHYDHVHVSF 



Sequence description 



A] Length: 419 bp - 139 aa (partial sequence) 
BJ N- and C-termini have yet to be determined 



ID- 149 



Clone 3-94 



ATGATTCCAGTAGTTATTGAACAAACAAGTCGTGGTGAACGTTCTTATGAT 

ATTTACTCACGTCTTTTAAAAGATCGTATTATTATGTTGACAGGCCAAGTT 

GAGGATAATATGGCCAATAGTATCATTGCACAGTTATTGTTTCTCGATGCA 

CAAGATAATACAAAGGATATTTACCTTTATGTCAATACACCAGGTGGTTCA 

GTATCGGCTGGACTTGCTATTGTGGACACCATGAACTTCATTAAATCGGAC 

GTACAGACGATTGTTATGGGGATGGCTGCTTCGATGGGAACCATTATTGCT 

TCAAGTGGTGCTAAAGGAAAACGTTTTATGTTACCGAATGCAGAATATATG 
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ATCCACCAACCAATGGGCGGAACAGGCGGAGGTACACAGCAATCTGATAT 

GGCTATCGCTGCTGAGCATCTTTTAAAAACGCGTCATACTTTAGAAAAAAT 

CTTAGCTGATAATTCTGGTCAATCTATTGAAAAAGTCCATGATGATGCAGA 

GCGTGATCGTTGGATGAGTGCTCAAGAACACTTGATTATGGCTTTATTGAT 

GCTATTATG GAAAA TAATAATTTACAATAATAGATTTAAAAGAGTTGAGTT 

TACCAACTCTTTTTTTATTTGTTGGAATTATGTTATAATCTTAGTAATTACA 

GATATGACGCAGAAAGGAAAAAATTATTGA 

MIPWIEQTSRGERSYDIYSRLLKDRIIMLTGQVEDNMANSIIAQLLFLDAQDN 
TKDIYLYVNTPGGSVSAGLA1VDTMNF1KSDVQTIVMGMAASMGTIIASSGAK 
GKRFMLPNAEYMIHQPMGGTGGGTQQSDMAIAAEHLLKTRHTLEKILADNSG 
QSIEKVHDDAERDRWMSAQEHLIMALLMLLWKIIIYNNRFKRVEFTNSFFICW 
NYVIIL VITDMTQKGKN Y * 



Sequence description 



A] Length: 693 bp - 231 aa (full length gene) 
B j Putative ATG start codon is preceded by a 
typical Shine-Dalgarno sequence. No leader 
peptide. Significantly, it would appear to have a 
very hydrophobic C-terminus. 



ID- 150 



Clone 2-c86 



ATG AAAC C AAAA aTT ATTG GTGTACTTGGTCT AG G AAT ATTTGG AC AAAC A 
CTCGCACAAGAACTAAGTAACTTTGAACAAGATGTTATTGCTATTGACAGC 
AATCCTGAAAATGTACAAGCTGTCGCCGAAGT 

TGTTACAAAAGCAGCTATCGGAGACATTACTGATTTAGCTTTCCTAAAACA 

CATCGGGATCAGTGACTGTGATACTGTTATTATTGCTACAGGAAACAGTTT 

AGAGAGCTCAGTATTGGCCGTAATGCACTGTAAAAAGTTAGGCGTCCCAC 

AAGTTATTGCTAAAGCTCGAAACCTTGTATACGAAGAAGTACTTTATGAAA 

TTGGTGCTGATTTGGTTATCTCTCCGGAGCGAGAATCTGGGCAAAATGTTG 

CTGCAAACCTCATGAGAAATAAAATTACAGATGTCTTCCAGATTGAATCTG 

ATATTTCTGTCATTGAATTT 

MKPKIIGVLGLGIFGQTLAQELSNFEQDVIAIDSNPENVQAVAEWTKAAIGDI 
TDLAFLKHIG IS DCDTV 1 1 ATGNSLE 
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SSVLAVMHCKKLGWQVIAKARNLVYEEVLYEIGADLVISPERESGQHVAAN 
LMRNKITD V FQIES DI S V IEF 



Sequence description: 

A] Length: 459 bp - 153 aa (partial sequence) 

B] Putative ATG start codon is preceded by a 
typical Shine-Dalgamo sequence. Possesses a 
potential leader peptide sequence. 

This orf is not in frame with nuc 



ID-151 



Clone 2-c88 



GTGCGTTATAGTAAAGAGATTATTCAGTTAGCTATACCAGCTATGATTGAA 

AATATCTTACAAATGCTCATGGGAGTAGTTGATAATTATCTAGTGGCTCAG 

TTAGGTGTTGTAGCAGTATCAGGTGTTTCAGTTGCTAATAATATAATTACT 

ATTTATCAAGCTA 1 1 TI 1 ATAGCTTTAGGGGCGAGTATAGCAAGTCTATTG 

GCCAAGTCGTTAGCAGGTAGTGAGAAGGATGATGCAATTTCAGTATGTTCT 

CAAGCCATTTTTCTAACATCACTGATAGGGGCAGTATTAGGAATTATCTCG 

ATTGTTTTTGGACAAACTTTCTTT 

MRYSKEIIQLAIPAMIENILQMLMGVVDNYLVAQLGVVAVSGVSVANNIIT1Y 
QAIFIALGASIASLLAKSLAGSEKDDAISVCSQAIFLTSLIGAVLGIISIVFGQTFF 



Sequence description 

A] Length: 330 bp - 1 10 aa (partial sequence) 

B] Putative GTG start codon is preceded by a 
typical Shine-Dalgamo sequence. May have a 
leader peptide 



ID-152 
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Clone 2-c92 



TTGATTAACAAGTATTCGTGCTTTTTGAAGAGGATTCTCCATAATAATACT 
CCTTTAATAGTTATCGTGAGAAGTATTTTAAAGAAAAACCGCCAAGGTAG 
AGCGACATTTCTGCCTTTAACTACAATAAAACCAAGAGAATTAGCACAAC 
ATTATCTCTCAAAATTACAAAGTTCTCAAGGG 1 1 I'l l AG G AATAG CTAGTG 
AATTGGTAACCTATGATCAACGCTTGTCAAACATTTTT 

MINKYSCFLKRILHNNTPLIVIVRSILKKNRQGRATFLPLTTIKPRELAQHYLSK 
LQSSQGFLGIASELVTYDQRLSNIF 

Sequence description 



A] Length: 240 bp - 80 aa (partial sequence) 

B] No obvious Shine Dalgamo sequence precedes the Putative TTG start 
codon 



ID-153 



Clone 2-c94 



TTGTTGACTCACAAAAATATATTATTAACCATTATATTTGGATTATTTATGA 

TTATATTATCAGCATGTGGTATGTCTAATAAGGAAATGGCTGGTATTGATA 

ATTGGGAACATTATCAAAAGGAAAAGAAAATTACTATTGGATTTGATAAT 

ACTTTTGTTCCTATGGGATTTGAAAGTCGTTCTGGTGACTATACCGGCTTTG 

ATATTGATTTAGCTAATGCTG1 111 1 AAAGAATACGGTATTTCAGTGAAAT 

GGCAGCCTATTAACTGGGATATGAAAGAAACTGAACTTAATAATGGTAAT 

ATAGACCTTATTTGGAATGGTTATTCAAAAACGGCAGAACGTGCTAAAAA 

AGTCGCTTTTACAAACCCATATATGAATAATCATCAAGTAATTGTTACTAA 

AACTTCATCACATATTAATAGTATTAAGGATATGAAGGGGAAAAAACTAG 

G AGCCCAGTCGGGTTC ATCTGG1 "1 11 G ATGCTTTT AACGCTAAAC CTG ATA 

TTTTAAAAAAGTTTGTAAAAGGAAAAGAAG CAGTTCA ATACG ATACTTTC 

ACTCAGGCTTTGATTGATTTAAAAAATAACCGTATTGATGGTCTTTTGATT 

GATGAAGTTTATGCTAACTATTATTTAAAGCAAGAAGGAA 
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MLTHKNILLTIIFGLFMHLSACGMSNKEMAGIDNWEHYQKEKKITIGFDNTFV 

PMGFESRSGDYTGFDIDLANAVFKEYGISVKWQPINWDMKETELNNGNIDL1 

WNGYSKTAERAKKVAFTNPYMNNHQVIVTKTSSHINSIKDMKGKKLGAQSG 

SSGFDAFNAKPDILKKFVKGKEAVQYDTFTQALIDLKNNRIDGLLIDEVYANY 

YLKQEG 



Sequence description 



A] Length: 649 bp - 216 aa (partial sequence) 

B] TTG start codon is preceded by a possible 
typical Shine-Dai gamo sequence. Has a 
leader peptide 



ID-154 



Clone 2-clOO 



ATGAAAATTTGGAAAAAAATAACCTTAATGTTTTCTGCAATTATTTTAACA 

ACAGTAATTGCATTGGGAGTCTATGTTGCCTCAGCTTATAATTTTTCGACTA 

ATGAATTGTCTAAGACTTTT 

MKIWKKITLMFSAIILTTVIALGVYVASAYNFSTNELSKTF 



Sequence description 



A] Length: 123 bp - 41 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgamo sequence. Has a 
typical leader peptide 



ID- 155 



Clone 2-cl 
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ATGAAAAAACAAAGACTATTACTGCTTTTTGGAGGCTTATTAATAATGATA 

ATGATGACAGCATGTAAGGATTCAAAAATCCCAGAAAACCGCACGAAAAA 

GGAATACCAGGCAGAACAGAATTTTAAGTCATACTTT AAATA TATATCAG 

ATAAAAATAACTATTTAGATAATATAAAAGTTTATTACTTTTCTATAAGTA 

TTTCTAAAGATGTACAAGATAAAGTCAGTGAAACAACAACTTGTTCATATA 

GACTAGAAAAGCAAAAGAATCAAGAGTTCATTGGTAATTTTGAACATGAA 

GTTAGTGAATCTAGTCAATATTCAACCGAAGTTAAAAATCAAATACAGTAT 

CCAATCCAGTATAAAGATAATTCAATTCGTTTTACTGAAAAAACACCGTCA 

GAACGTTATGATGAGTTTGTTTTTAGTTCATTTGATTCTTCATTATTAAAAA 

AATATAAAATATATGATTACTTACTAAAACATCCCGAAACTGA ATTA AAA 

GGTGTTTCCTATAAGATTCCTATAAATTCTGAAATTGTAGCCCCTTTTATAA 

ATCAATTAAATATAAAAAATCCTAAAAAATCATCTATTTCGGTTACAAAAA 

CGGAAAGTAAAGAATATTATTATACAATCAGTATTGATACTGATTCTGAGA 

TATATTCTATATTCGAAGGTATTCAT 

MKKQRLLLLFGGLLIMIMMTACKDSKIPENRTKKEYQAEQNFKSYFKYISDKN 

NYLDNIKVYYFSISISKDVQDKVSETTTCSYRLEKQKNQEFIGNFEHEVSESSQ 

YSTEVKNQIQYPIQYKDNSIRFTEKTPSERYDEFVFSSFDSSLLKKYKIYDYLLK 

HPETELKGVSYKIPINSEIVAPFINQLNIKNPKKSSISVTKTESKEYYYTISIDTDS 

EIYSIFEGIH 



Sequence description 



A] Length: 687 bp - 229 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dai garno sequence. Has a 
typical leader peptide. C-terminus has yet to be 
verified 



ID- 156 



Clone 2-c5 



ATGACATTTGACACCATTGATCAATTAGCGGTTAATACAGTCCGCACGCTT 
TCTATTGATGCTATCCAAGCAGCAAATTCTGGGCACCCAGGTCTTCCTATG 
GGAGCTGCGCCTATGGCTTATGTGCTTTGGAATAAATTCTTAAATGTAAAC 
CCAAAAACAAGTCGCAATTGGACAAACCGTGACCGTTTTGTACTTTCAGCT 
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GGGCATGGTTCAGCTCTTCTTTATAGCCTACTTCATTTAGCTGGCTATGATT 
TATCAATTGATGATTT 

MTFDTIDQLAVNTVRTLSIDAIQAANSGHPGLPMGAAPMAYVLWNKFLNVNP 
KTSRNWTNRDRFVLSAGHGSALLYSLLHLAGYDLSIDD 



Sequence description 



A] Length: 272 bp - 90 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgamo sequence. No obvious 
leader peptide 



ID-157 



Clone 2-c8 



ATGAGAACACTATTTAGAATGATATTTGCTATTCCAAAGTTTATCTTTAGA 
TTGATTTGGAATATCATTTGGGGAATATTCAAGACAGTTCTTGTTATTGCG 
ATTATTTTATTTGGCTTGTATTACTATGCGAATCACAGTCAATCAGAATTTG 
CTAATCAACTTAGTGACATTATTCAGACAGGAAAAACATTTTT 

MRTLFRMIFAIPKFIFRLIWNIIWGIFKTVLVIAIILFGLYYYANHSQSEFANQLS 
DI1QTGKTF 



Sequence description 



A] Length: 197 bp - 65 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgamo sequence. Possesses a 
leader peptide 



ID-158 
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Clone 2-c9 

ATGTCAAAAAAAATAATATTAGGAATTTTATCTCTTTTATCTGTCGTTACTT 
TGGTGGCGTGTGGTTCATCAGACAAACAGCTACAAGATAAAGTTGAGAAA 
AAAG G G AAGTTAGTTTTAG C G GTG AGTCC AG ATT ATG CTCC CTTTG AGTTT 

MSKKIILGILSLLSWTLVACGSSDKQLQDKVEKKGKLVLAVSPDYAPFEF 



Sequence description 

A] Length: 153 bp - 51 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-DaJgarno sequence. Possesses a 
leader peptide (not in frame with nuc) 



ID-159 



Clone 2-clO 



ATGAAAAATCAAAGACTATTACTGCTTTTTGGAGGCTTATTAATAATGATA 

ATGATGACAGCATGTAAGGATTCAAAAATCCCAGAAAACCGCACGAAAAA 

GGAATACCAGGCAGAACAGAATTTTAAGTCATACTTT 

MKNQRLLLLFGGLLIMIMMTACKDSKIPENRTKKEYQAEQNFKSYF 



Sequence description 

A] Length: 139 bp - 46 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgamo sequence. Possesses a 
leader peptide 
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ID- 160 



Clone 2-cl 1 



A TG AT TG G AAAA TTATA TTAT AG CTATA G AAAGTC ACG CTTATT AAG A AGT 

ATTTTATGGCTTATTTTAATTGTTGGTGTATATATGTTAGGACAACGTGTTT 

TATTATCCACTGTTCCTTTATCACATCAAGAGATAAAACTAGCAGTAGATC 

AACATT TACT CAATAACTTTTCAGCAGTAAGTGGTGGGAGTTTTAATAAAT 

TAAATGTTTTCACACTG GGGT TGAGTCCATGGATGTCAAGTATGATTATTT 

GGAGATTCGTTTCCTTATTTTCGTGGGCAAAAAATGCAACGAAGCGAAAA 

GCAGAAGTAGCTCAATATACTTTAATGCTTACTATCTCAGTTATACAAGCA 

TATGGTGTTTCAGGAAATCAATTTATAAAAAGCTCTTTATTAGGTTCTTATA 

GTGATATTGTTTTT 

MIGKLYYSYRKSRLLRSILWLILIVGVYMLGQRVLLSTVPLSHQEIKLAVDQHL 

LhnvfFSAVSGGSFNKLNVFTLGLSPWMSSMIIWRFVSLFSWAKNATKRKAEVA 

QYTLMLTISVIQAYGVSGNQFIKSSLLGSYSDIVF 



Sequence description 



A] Length: 423 bp - 141 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. Possesses a 
leader peptide 



ID-161 



Clone 2-cI3 



ATGAAAGGTCTATTGGATTTTTTAGTTAATATTGCCAGAACGCCAGCTATT 

TTAGTCGCCTTGATAGCCATTATCGGTTTAGTACTGCAGAAAAAAGGTGTT 

CCTGATATTGTAAAAGGTGGAATAAAAACATTTGTTGGCTTCTTAGTGGTT 

TCTGAAGGTGCAGGGATAGTCCAAAATTCCTTGAATCCATTTGGAAAAATG 

TTTGAACATGCTTTTCATTTGGTGGGGGTAGTTCCTAATAATGAAGCCATT 

GTAGCAGTAGCTCTTACGAAGTATGGCTCAGCAACTGCTTTGATTATGTTA 

GCGGGAATGATTTTTAATATTTTAATTGCTCGTTTTACAAAA 
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MKGLLDFLVNIARTPAILVALIAIIGLVLQKKGVPDIVKGGIKTFVGFLVVSEG 
AGIVQNSLNPFGKMFEHAFHLVGVVPNNEAIVAVALTKYGSATALIMLAGMI 
FNILIARFTK 



Sequence description 



A] Length: 348 bp - 1 16 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
Shine-Dalgamo sequence. Possible leader 
peptide 

ID-162 



Clone 2-c21 



TTGGTTGGTAAGCCCCAATTACTATTTTTAGATGAACCTACTTCCGGAATG 

GATACTTCCACACGTCAACGATTTTGGAAGCTGGTTGCGACACTAAAAAA 

AGAAGGTGACACAATTGTCTATTCTAGTCATTATATCGAAGAGGTAGAAC 

ATACAGCTGATAGGATTTTAGTACTTCATAAAGGAAAGTTATTACGCGATA 

CAACCCCCTTTGCCATGAAGCAAGAAAAAACCGAAAAGTTATTCACCGTT 

CC G CTT AGTTATC AAAAATT ATTAC CT ACCTATTTG ATTAC AG A GTGTG AA 

GCCAAGAGTGATAGTATAACGTTTGTTACTGGGGAGGCTGAAACTGTATG 

GAAAATACTGGCAGATAATGGTTGTCCTATTGAAGCTATTGAGATGACCA 

ATAGAACTTTGTTAAATCGTATTTTTGAGACTACTAAGGAGGTAAAACATG 
AGAATCTTTA 

MVGKPQLLFLDEPTSGMDTSTRQRFWKLVATLKKEGDTIVYSSHYIEEVEHTA 

DRJLVLHKGKLLRDTTPFAMKQEKTEKLFTVPLSYQKLLPTYLITECEAKSDSI 

TFVTGEAETVWKILADNGCPIEAIEMTNRTLLNRIFETTKEVKHENL 



Sequence description 



A] Length: 462 bp - 1 55 aa (partial sequence) 

B] B] Putative TTG start codon is not preceded by 
an obvious Shine-Dalgamo sequence. No obvious 
leader peptide. N- and C- termini require further 
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examination. 

ID-163 



Clone 2-c25 



TTGAAAAAATCCAAGAGAAGCCGTAAGGCAGTGACAACAAGTGGTGAGA 

AGACTTTACTTGAGGATTTGGCAAAAATGAATTTCCTAGACGAAGTCATTA 

ATGTTATGGTTTTATATACCTTGAATAAGACAAAATCTGCTAACTTAAATA 

AGGCCTATATCATGAAAGTTGCTAATGA11 TI GCCTTTCAGAATGTTATGA 

CGGCCGAAGATGCTGTGCTTAAAATTCGTGATTTTTCAGATCAAAAAGTAA 

GGACTAAAACAGAAACGAAGAAGAAACAATCGAATGTTCCTGAATGGAGT 

AATCCTGATTATAAAGATGAGGTTAGCCCAGAAAAAGAAATTGAATTAGA 

ACAGTTT 

MKKSKRSRKAVTTSGEKTLLEDLAKMNFLDEVINVMVLYTLNKTKSANLNK 
AY1MKVANDFAFQNVMTAEDAVLKIRDFSDQKVRTKTETKKKQSNVPEWSN 
PDYKDEVSPEKEIELEQF 



Sequence description 



A] Length:360 bp - 120 aa (partial sequence) 

B] N- and C- termini require verification. 



ID- 164 



Clone 2-c28 



ATGACGAATCATATTACTAAACTGATAGAAAATAGCGGAAAAAAATTGAC 

AGAAATTAGCGAAGCTACAGATATAGCCTATCCTACACTTTCTGGATACAA 

TCAAGGAATCCGCAAACCTAAAAAAGATAATGCTGAAAAATTGGCAAAAT 

ACTTTAATGTTTCCGTCGCTTACATTATGGGACTTGATAGCAACCCACATG 

CTCCATCAAATCTT 

MTNHITK1IENSGKKLTEISEATDIAYPTLSGYNQGIRKPKKDNAEKLAKYFNV 
SVAYIMGLDSNPHAPSNL 
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Sequence description 

A] Length:218 bp - 72 aa (partial sequence) 

B] ATG start codon is preceded by an 
obvious Shine Dalgamo sequence. No obvious 
leader peptide. 



ID- 165 



Clone 2-c29 



TTGATGAAAAGGAATAAACATTTACCGTTAACAGAAACTACCTATTATATT 

TTATTAGCTTTGTTTGAGGAAGCGCATGGCTATGCTATTATGAAAAAAGTT 

GAAGAAATGAGTGGCGGTGATGTTAGAATAGCCGCAGGGACAATGTACGG 

TGCCATTGAAAATTTACTTAAACAAAAATGGATAAAGTCTATCTCAAGTGA 

CGATAGAAGAAGAAAAGTTTATATTATTACTGAGACAGGAAAAGAAATAG 

TAGAACTTGAAACGAATCGATTAAGAAAGTTACTTAATACTGCTAATCAGT 

TGGGTTTTGGAGGAGATGGTTATGATAAAGTTT 

MMKRNKHLPLTETTYYILLALFEEAHGYAIMKKVEEMSGGDVRIAAGTMYG 
AIENLLKQKWIKSISSDDRRRKVYIITETGKE1VELETNRLRKLLNTANQLGFG 
GDGYDKV 



Sequence description 

A] Length:337 bp - 1 12 aa (partial sequence) 

B] TTG start codon is preceded by an 
obvious Shine Dalgamo sequence. Actual start 
codon may ATG that comes immediately after the 
TTG. Potential leader peptide. 



ID- 166 
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Clone 2-c35 



CCCATTACTGGTGAGTTAATAGCTGAGAAATTAGGAGTACCAAGAGCAGC 

ACTAAGGTCTGATTT GCGGG TTTTAAGTATGCTAGGTATCATAGATGCAAA 

ACCTAAGGTTGGTTATTTTTATTTAGGACAGTATCATGCTTCAATAGGGAC 

AAGTCATTTTGAAAAGATGACAGTTTCAGAAATTATGGGGATCCTTCTGAC 

AGTTC ATC AAAAAG ATTC AGTTTATG ATGTTATTGTAC ATA I 111 I ATGG A 

AGATGCTGGTTGTGCTTTTATCTTGGATGATGATGATTTTCTCTGTGGAGTC 

GTGTCACGTAAAGATTTACTAAAAACCAGTATTGGCGGAGGAGATCTTTCT 

AAAATGCCAATAGGAATGGTGATGACACGTATGCCACACGTGACAACTGT 

TTTAGAAAATGAAAGTCTTTTTGCGGCAGCTGATAAATTAGTGAGCAGAA 

AAGTGGATAGTCTCCCTGTCGTTCGTCATGATAAGCAATATCCCGAAAAAT 

TTA 

PITGELIAEKLGVPRAALRSDLRVLSMLGIIDAKPKVGYFYLGQYHASIGTSHF 
EKMTVSEIMGILLTVHQKDSVYDVIVHIFMEDAGCAFILDDDDFLCGVVSRKD 
LLKTSIGGGDLSKMPIGMVMTRMPHVTTVLENESLFAAADKLVSRKVDSLPV 
VRHDKQYPEKF 



Sequence description 



A] Length:51 1 bp - 1 70 aa (partial sequence) 

B] N- and C-termini to be determined 



ID- 167 



Clone 2-44 



TTGGAAGTCATCATGCAATTTATTTATAGTATTATTGGTATTTTATTGGTAT 
TAGGAATTGTGTATGCAATTTCTTTCAATCGTAAGAGTGTTTCTCTAAGTTT 
AATTG G A AAAG CTCTTATCGTTC AATTC ATTATTGCG CTAATCTTAGTACGT 
ATCCCACTAGGCCAACAAGTTGTTAGTGTTGTTTCAACTGGAGTTACTAAA 
GT AATC AACTGTGGTC AAG CTGGTTT 

MEVIMQFIYSIIGILLVLGIVYAISFNRKSVSLSLIGKALIVQFIIALILVRIPLGQQ 
WSVVSTGVTKVINCGQAG 
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Sequence description 

A] Length:233 bp - 77 aa (partial sequence) 

B] TTG start codon is preceded by a 
possible Shine Dalgamo sequence. Actual start 
codon may occur further downstream. Potential 
leader peptide. 

ID-168 



Clone 2-46 



CAACCTAATAAAGCTTTAGAAAGTGATGAGATTGATATTAATGCTTTCCAG 

CATTATAATTACTTAACCAATTGGAATAAAGCAAATAAGACCAATCTTGTT 

TCCGTTGCTGAGACATACTTTACTTCCTTTAGATTATACTCTGGTACTAAGA 

ACGGTAAAGGTAAATACCAAACAGTTTCTGAAATTCCAAATAAAGCAACT 

ATTACTATCCCAAACGATGCAGTTAACGAAAGTCGCTCTCTCTACTTGTTA 

CAATCAGCAGGCTTGCTAAAATTGAAAGTATCAGGTGATACATTAGCAAC 

AATGTCAGATGTTGTTTCCAATCCTAAATCTTTAGATTT 

QPNKALESDEIDINAFQHYNYLTNWNKANKTNLVSVAETYFTSFRLYSGTKN 
GKGKYQTVSEEPNKATITIPNDAVNESRSLYLLQSAGLLKLKVSGDTLATMSD 

WSNPKSLD 



Sequence description 



A] Length:344 bp - 1 14 aa (partial sequence) 

B] N- and C- termini require verification 



ID-169 



Clone 2-47 



ATGAAATGTATAATAAATAATATAAATAAAATAAAAATGATAATTGAGAT 
TTATCATAGAAGGAAAACTATTTTGAAATTAAATAAAATCATATTATCTAC 
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TGCAGCTCTTACTGCTCTC! 1 I'll AGG ATATA ATAGCGTTACTGCGG ATACA 
TATAATAACTATCAGCCACATAGATCAAATAATATGGATTTAACTGAGGA 
ATATAACTATAATAACCAGATAGAACTTCAGGAGCGTATAAAAAACCTAA 
AT AT AC CTTTT 

MKCIINNINKIKMIIEIYHRRKTILKLNKIILSTAALTALFLGYNSVTADTYNNY 
QPHRSNNMDLTEEYNYNNQIELQERJKNLNIPF 



Sequence description 

A] Length:264 bp - 88 aa (partial sequence) 

B] There is a Shine-Dalgarno sequence upstream 
of this sequence. Potential leader peptide 
sequence 



ID- 169 



Clone 2-47 



ATGAAATGTATAATAAATAATATAAATAAAATAAAAATGATAATTGAGAT 

TTATCATAGAAGGAAAACTATTTTGAAATTAAATAAAATCATATTATCTAC 

TGCAGCTCTTACTGCTCTC 1111 TAG G ATATAATA GCGTT ACTGCG GAT AC A 

TATAATAACTATCAGCCACATAGATCAAATAATATGGATTTAACTGAGGA 

ATATAACTATAATAACCAGATAGAACTTCAGGAGCGTATAAAAAACCTAA 

ATATACCTTTT 

MKCIINNINKIKMIIEIYHRJRXTILKLNKIILSTAALTALFLGYNSVTADTYNNY 
QPHRSNNMDLTEEYNYNNQIELQERJKNLNIPF 



Sequence description 

A] Length: 264 bp - 88 aa (partial sequence) 

B] There is a Shine-Dalgarno sequence upstream 
of this sequence. Potential leader peptide 
sequence 
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ID- 170 



Clone RS-58b 



TTGG GTG ATT ATTATGGTAAG AA AT ATTTTGGTG AG G C AG CT A AAAAAG A 
CGTCGAACATATGGCTAAGAAAATCATTAATGTCTATAAAACACGGTTAA 

AAAACAACACTTGGTTATC 

AGAAAATACAAAAGCAATGGCCATTAAGAAACTTGATAACATGAGATTAA 

TGATTGGCTATCCAGAAGATTATCCTGATCTTTATCGTCAGTACCAATTTG 

ATAGTAAAGCAAGCTTCTTTGAAAACAATGATAACTACAGAAAATTATCG 

AACAAGAAAACATTTGAAGAATTTAACCAGTCTAATCAACGTGAACATTG 

GCAAATGAGTGCCAATGCTGTAAATGCTTATAATGATCCTAATACCAATTC 

CATAGTCTTTCCAGCAGCGATTTTTCAATCACCACTGTACGATAAAACTAA 

AACAGTTAGTCAAAATTATGGAGCTATCGGAGCAATTATTGGTCATGAAAT 

TTCACACTCATTTGATATTAATGGTATGAAATATGACGAGAAAGGGAATCT 

TCACGATTGGTGGACTAAAGAAGATTTAAATCATTATAAGAAATCAACAC 

AAGCTATGATTGACCAATGGGATGGCCTTAAAGCAGATGGCGGTAAAGTT 

G ATGGTA AATTAACTTT AGC A G AAAATATTG C AG AT AATG GTG GTGTTATG 

GCATCTCTAGAAGCTCTTAAGACTGAAAAAATCCAAACTATAAAGAATTTT 

TTGAATCATGGGCAAGTATTTGGCGTCAAAAAGCAACCAAAGAACAAAGT 

AAGTCCTCAATTCAGTCAGATGTTCATGCACCATATGAATTGA > 

GAGCTAACATCCCAGTACGTAATTTCCAAGAATTTTATGATGCCTTTGGTG 

TTAAAAAAGGCGATTCAATGTATCTAAAACCAGAAAAACGTTTGACACTTT 

GGTAA 



MGDYYGKXYFGEAAKKDVEHMAKKIINVYKTRLKNNTWLSENTKAMAIKK 

LDNMRLMIGYPDYPDLYRQYQFDSKASFFENNDNYRKLSNKKTFEEFNQSNQ 

REHWQMSANAVNAYNDPNTNSIVFPAAIFQSPLYDKTKTVSQNYGAIGAIIGH 

EISHSFDINGMKYDEKGNLHDWWTKEDLNHYKKSTQAMIDQWDGLKADGG 

KVDGKLTLAENIADNGGVMASLEALKTEKIQTIKNFLNHGQVFGVKKQPKNK 

VSPQFSQMFMHHMN * 

Sequence description: 

A] Length: 819 bp - 272 aa (full length gene) 
(107 bp of additional DNA sequence (> onwards) is 
also included. While not in-frame with the 
described orf, it also shares strong homology 
with the neutral peptidases. 
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B] This gene sequence was not identified using the LEEP system. It was identified 
downstream of the ID-89 gene which was identified by LEEP, during cloning and 
sequence analysis of the full-length ID-89 gene sequence. ID-89 and ID- 170 together 
show homology over their combined entire length with the neutral endopeptidases 
from Lactococcus and Lactobacillus. Possesses TTG (possible ATG start codon 
located 13 bp further downstream) start codon with no obvious signal peptide. Shine 
Dalgamo sequence not immediately obvious. Possibly located further downstream 



ID-171 



Clone 2-18/22b (Mod2) 



ATGACCATGATTACGCCAAGCTTCATTAAGGTATCTCTAGATGAAACAAAT 

CGTATGATGCGTATGATATCAGATTTATTAAGTTTATCGCGCATTGATAAT 

GAAGTAACGCATTTAGATGTTGAAATGACGAATTTTACAGCTTTCATGACC 

TCAATTTTGAATCGATTTGATCAGATTAGAAATCAAAAAACAGTCACAGG 

AAAAGTTTATGAAATTGTCAGAGATTATCCTCTTAAGTCAATTTGGGTGGA 

AATTGATACAGATAAGATGACTCAAGTGATTGATAACATTTTAAATAATGC 

AGTCAAGTATTCACCAGATGGTGGTAAGATTACAGTTAATCTACGCACAAC 

TAAAACGCAGATGATTTTATCAATATCAGACCAAGGCTTAGGTATTCCCAA 

AAAAGATTTACCTCTC A 11111 GATCG I'll 1 I ATCGTGTTG ATAAGGCG AG A 

AGTCGTCAACAGGGTGGGACTGGACTTGGTTTGTCAATTGCAAAAGAAAT 

TGTTAAGCAGCATAAGGGATTTATTTGGGCTAAGAGTGAGTATGGTAAAG 

GGTCTACTTTTACAATCGTCTTGCCTTATGATAAAGATGCTGTAACTTATGA 

AG AATGG G AGG ACGTTGAA G ATTAA 



N^MITPSFIKVSLDETNRMMRMISDLLSLSRIDNEVTHLDVEMTNFTAFMTSIL 
NRFDQIRNQKTVTGKVYEIVRDYPLKSIWVEIDTDKMTQVIDNILNNAVKYSP 
DGGKITVNLRTTKTQMILSISDQGLGIPKKDLPLIFDRFYRVDKARSRQQGGTG 
LGLSIAKEIVKQHKGFIWAKSEYGKGSTFTIVLPYDKDAVTYEEWEDVED* 



Sequence description: 

A] Length: 613 bp - 212 aa (full-length gene possibly) 

B] Possible Shine Dalgamo sequence present 
upstream of a ATG start codon. May not have yet 
determined the N- portion of this gene. No 
obvious signal peptide. 
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ID- 172 



Clone 2-54baltemate (107b) 



TTGAAAAAAATTATTACTTCTATTCTATTACTTAGTTGCATTTTTTTTATGC 

C AAC C ATCTCTG CTG AATCTTTT AATG CTTCCGCTAAAC ATGCCTT AG C AGT 

TGATTTAGATTCAGGAAAAATCTTGTATGAAAAAGATGCTAACAAACCCG 

CTGCTATTGCTTCCTTGACTAAAATAATGACCGTTTATATGGTCTATAAAG 

AAATTGATAACGGTAACCTCAAGTGGAATACCAAAGTAAATATATCTGAC 

TACCCTTATCAACTAACACGCGAATCTGATGCTAGTAATGTTCCTTTAGAA 

AAAAGGCGCTATACTGTTAAACAACTCGTGGACGCTGCCATGATTTCTAGT 

GCTAACAGTGCAGCCATTGCTTTAGCTGAACATATTTCAGGAACTGAAAGT 

AAATTTGTTGATAAAATGACTGCTCAATTGGAAAAGTGGGGAATTCATGAT 

AGCCACCTAGTCAATGCTTCTGGCTTAAATAATAGTATGTTAGGCAATCAC 

ATTTATCCAAAATCGTCACAAAACGACGAAAATAAAATGAGTGCACGTGA 

TATTGCTATTGCTGCCTACCATTTGGTCAACGAATATCCTTCCATTCTTAAG 

ATTACTAGTAAGTCCGTTGCTAAATTTGATAAAGATATTATGCATTCTTAT 

AACTACATGCTACCAGATATGCCTGTCTTTAGACCAGGTATTACAGGTTTG 

AAAACTGGGACAACGGAATTAGCTGGCCAATCTTTTATTGCTACATCTACT 

GAAAGTGGAATGAGACTACTCACTGTTATTATGCATGCTGATAAGGCCGAT 

AAAGACAAATATGCTCGCTTTACAGCAACTAACTCTCTCTTGAACTATATC 

ACAAACACCTACGAACCTAACCTTGTATTAGCTAAAGGAGCTGCATATAA 

AGGTAAAGAAGCAAGTGTGAGAGACGGAAAAGAACAATCGGTCATCGCT 

GTTGCTAAAAACGATTTGAAAGTAGTACAGAAGAAAAATATCACTAAACA 

AAATCAGTTAAAAATTAACTTTAAAAAAGAGCTTACTGCTCCTATTACAAA 

AAAAGAGAACCTAGGGAAAGCTTATTACGTTGACCTTAATAAGGTTGGAA 

AAGGCTATCTCATAAAGGAACCTAGCGTTCATTTAGTGGCAAAAGATAGT 

ATTGAGCGCAGTTTCTTCCTCAAAGTGTGGTGGAATCATTTTGTGCGCTAC 

GTTAACGAAAAACTTTAA 



MKK1ITSILLLSCIFFMPTISAESFNASAKHALAVDLDSGKILYEKDANKPAAIA 

SLTK1MTVYMVYKEIDNGNLKWNTKVNISDYPYQLTRESDASNVPLEKRRYT 

VKQLVDAAMISSANSAAIALAEHISGTESKFVDKMTAQLEKWGIHDSHLVNA 

SGLNNSMLGNHIYPKSSQNDENKMSARDIAIAAYHLVNEYPSILKITSKSVAKF 

DKDIMHSYNYMLPDMPVFRPGITGLKTGTTELAGQSF1ATSTESGMRLLTVIM 

HADKADKDK YARFTATN S LLN YITNTYEPNL V LAKG AA YKGKE AS VRDGKE 

QSVIAVAKNDLKVVQKKNITKQNQLKJNFKKELTAPITKKENLGKAYYVDLN 

KVGKGYLIKEPSVHLVAKDS1ERSFFLKVWWNHFVRYVNEKL* 
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Sequence description: 

A] Length: 1236 bp - 412 aa (full-length gene sequence possibly) 

B] A possible Shine-Dalgarno sequence precedes the putative l TTG' start 
codon. (needs further cloning and sequencing to verify N-terminus) 



ID- 173 
Clone 3-60b 



ATGACGCTTCGAGAATTAACAATAGAAGAATTTAAAGAACATTCAGGAAA 

TTATGATTCACAATCATTTTTACAAAC ACCTGAGATGGCTAAAC IT IT AG A 

AAAACGCGGCTATGATGTTAGGTATTTGGGATATCAAGTAGAAAATAAAC 

TAGAGATAATCAGTTTATCTTATATTATGCCAGTCACTGGTGGTTTTCAAAT 

GAAAATTGATTCAGGACCAGTTCATTCAAATTCTAAGTATCTAAAACAATT 

TTATAAAGCATTGCAAGGCTATGCCAAATCCAACGGTGTTCTAGAATTAAT 

AGTTG AGCCTTTTG ATG ATTACC AATT ATTC A CTAGTTCGG G AGTTCCTAGT 

AATCAGGGAAATGATAATCTGATTGAAGATTTTACCAGTTCAGGTTATCAC 

CATGATGGTTTTAACAACTGGTTTTACTGGTAAATATTTATCTTGGCACTATG 

TTAAAAATTTAGAAGGTGTCACTTCTGAAACGTTACTATCTTCATTCTCTAA 

GACAGGACGAGCTTTGGTTAAGAAAGCAATGTCTTTTGGAATCAAGGTTC 

GCGTTCTTAAACGTGATGAGCTACATTTATTTAAAGAGATAACAACTTCTA 

CGTCAAATAGACGTGATTATATGGATAAGTCCTTAGATTATTATCAAGATT 

TTTACGATAGCTTTGAAGGCAAGGCTGAATTTGTGATTGCCACTTTAAATT 

TTAGAGAATACGACCATAACTTGCAAATAAAAGCTGAAGCATTGGAAAAT 

AAGCTT 



MTLRELTIEEFKEHSGNYDSQSFLQTPEMAKLLEKRGYDVRYLGYQVENKLEI 

ISLSYIMPVTGGFQMKIDSGPVHSNSKYLKQFYKALQGYAKSNGVLELIVEPF 

DDYQLFTSSGVPSNQGNDNLIEDFTSSGYHHDGLTTGFTGKYLSWHYVKNLE 

GVTSETLLSSFSKTGRALVKKAMSFGIKVRVLKRDELHLFKEITTSTSNPvKDY 

MDKSLDYYQDFYDSFEGKAEFVIATLNFREYDHNLQIKAEALENKL 



Sequence description 

A) Length: 771 bp - 257 aa (partial gene sequence) 

B) This gene sequence was not identified using the LEEP system. It was 
identified immediately downstream of the ID-65 gene which was identified by 
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LEEP, during cloning and sequence analysis of the full-length ID-65 gene 
sequence. Sequence Characteristics: 
No obvious leader peptide sequence 
Orf is preceded by a potential Shine- 
Dalgamo sequence. 



ID- 174 

Clone 2-17b(ID-80b) 



TTGTCATTAAGTTTGGTTGCAGTGTTAAATCTTATCCCTCCTAAAATCATGG 

GATCAGTTATTGATGCTATTACAACTGGAAAATTAACAAGACCACAATTAC 

TATGGAATTTATTAGGTTTGGTTTTGTCAGCTTTAGCTATGTATGGGCTGCG 

TTATATTTGGCGTATGTATATTTTAGGGACTTCTTACAAATTAGGCCAAGTT 

GTCAGATACCGTTTATTTGAAC ATTTTACAAAAATGTCTCCTTC 11111 ATC 

AGAAATATCGTACAGGTGATTTAATGGCGCACGCGACCAACGACATCAAT 

TCTCTAACACGTCTTGCAGGAGGAGGAGTTATGTCAGCAGTGGATGCCTCT 

ATCACAGCATTAGTAACGCTTATCACCATGTTCTTTACTATTTCGTGGCAA 

ATGACATTAATTGCGGTTATCCCTTTGCCCTTAATGGCCTTAGCACTAGTA 

AATTGGGGCGAAAAACCCATGAAACCTTCAAAGAATCTCAGGCAGCCCTT 

TTCAGAATTAAATAATAAAGTG 

MSLSLVAVLNLIPPKIMGSVIDAITTGKLTRPQLLWNLLGLVLSALAMYGLRYI 
WRMYILGTSYKLGQVVRYRLFEHFTKMSPSFYQKYRTGDLMAHATNDINSLT 
RLAGGGVMSAVDASITALVTLITMFFTISWQMTLIAVIPLPLMALALVNWGEK 
PMKPSKNLRQPFSELNNKV 



Sequence description 

A) Length: 534 bp - 1 78 aa (partial gene 
sequence) 

B) This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-80 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-80 gene sequence. 
Sequence Characteristics: 

No obvious leader peptide sequence 
Orf is preceded by a potential Shine- 
Dalgamo sequence. 
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ID- 175 



Clone 2-1 lAb (ID- 103b) 



ATGCATATTGAGACTGTTATTGATTTCAAAGAATTAGGAAAAAGATATCGT 

TTTAAAAATCCTACAAAAGAATTAATAGCTGATACTTTAGAACAAGTCTTA 

GAAGTGATAAAAGAAGTTGATTATTATCAATCTCAAAATTATTATGTTGTT 

GGTTATTTATCTTATG AAGCATCTGCTGC TT I "I G ATTC AC AT 11 1 AAAGTTT 

CTCAACAGAAGTTGGCTGGAGAACATCTAGCTTATTTTACAGTACATAAAG 

ATTGTG AG AACG AAG CTTTTC CTTTAAGTTATG AAAATGTT AG ATTAGC AG 

ATAATTGGACTGCTAATGTTTCTGAGCAAGAATATCAAGAGGCAATTGCTA 

ATATTAAAGGACAAATTAGACAAGGAAATACTTATCAAGTAAATTATACA 

CTAGAGCTTAGCCAACAATTATGCTCGGATCC 



MHIETVIDFKELGKRYRFKNPTKELIADTLEQVLEVIKEVDYYQSQNYYWGY 
LSYEASAAFDSHFKVSQQKLAGEHLAYFTVHKDCENEAFPLSYENVRLADNW 
TANVSEQE YQE AI AN IKG QIRQ GNTYQ VN YTLELSQQLC SD 



Sequence description: 

A] Length: 440 bp - 146 aa (partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID- 103 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID- 103 gene sequence. 
Shine Dalgarno sequence present upstream of 

ATG start codon, No apparent leader peptide sequence 

ID-176 



Clone 2-18/22b(b) (ID- 104b) 



GTGAATAATATGTTTTATCTCAAAATAGCCTGGCATAATTTAAAACATTCT 

ATAGACCAGTACATACCATTCCTCTTAGCCAGTTTATTACTTTATTCATTGA 

CTTGTTCTACGCTACTAATCTTAATGAGTGCTGTTGGAAGAGATATGGGGA 

CAGCGGCAACGGTTCTTTTTCTTGGAGTGATTGTTTTGTCAATCTTTGCGGT 

AGTCATGGAACATTATAGCTACAATATCTTGATGAAACAGCGTAGTAGTG 
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AATTTGGACTGTATAACATTTTGGGGATGAATAAACGTCAAGTTGCGCGTG 

TAGCTAGTCTAGAGCTGTTTATTATTTATATATTTCTTATTTCTATAGGAAG 

TCTGTTTAGTGCTTTTTTTGCTAAATTTATTTAT7TAATTTTTGTCAACATTA 

TTAACTATCATGCACTAAATCTTAGTTTAAGTTTATGGCCATTTATTATTTG 

TATCGTTATATTTAC AGGTA'l ' I I'll CTGACTTTAG AAGTTCCAGTTATTCG A 

CATGTTCATTTATCATCCCCATTAAGTCTTTTTAGAAAGAAACAACAGGGA 

GAAAAAGAACCAAAAGGTAATCTTATACTTGCAATTTTAGCGTTAGTAGCT 

ATCGCCATCGCTTATACAATGGCTCTTACTTCAGGTAAAGCACCTGCATTA 

GCTGTTATCTATCGTTTCTTCTTTGCAGTACTTTTAGTAATTGCTGGTACTT 

ATCTTTTTTATATTAGTTTTATGACATGGTACTTAAAAAGGTTGCGTCAAAA 

CAAGCATTATTATTATAAATCTGAGCATTTTGTATCAACTTCGCAAATGAT 

TTTTCGAATGAAGCAAAATGCAGTAGGGTTAGCAAGTATCACTTTATTAGC 

TGTTATGGCTCTAGTTACTATTGCTACAACAGTCTCACTCTATTCAAATACA 

CAAAATGTTGTTACCGGACTATTTCCAAAATCAGTAAGTTTATCAATAGAT 

AATTCAAAAGGTGACGCGAAAAATATATTTGAAGAAAAGATTTTGAAGAA 

ACTAGGTAAGTCATCTAAGGAAGCTATCACTTATAATCAGACAATGATTTC 

GATGCCAGTTAGTCAATCAAGTGACTTAATATCACATCTA 



MNNMFYLKJAWHNLKHSIDQYIPFLLASLLLYSLTCSTLLILMSAVGRDMGTA 

ATVLFLGVIVLSIFAVVMEHYSYNILMKQRSSEFGLYNILGMNKRQVARVASL 

ELFIIYIFLISIGSLFSAFFAKFIYLIFVNIINYHALNLSLSLWPFIICIVIFTGIFLTLE 

VPVIRHVHLSSPLSLFRKKQQGEKEPKGNLILA1LALVAIAIAYTMALTSGKAP 

ALAVIYRFFFAVLLVIAGTYLFY1SFMTWYLKRLRQNKHYYYKSEHFVSTSQM 

IFRMKQNAVGLASITLLAVMALVTIATTVSLYSNTQNWTGLFPKSVSLSIDNS 

KGDAKNIFEEKILKKLGKSSKEAITYNQTMISMPVSQSSDLISHL 



Sequence description: 

A] Length: 1 1 19 bp - 373 aa (partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID- 104 gene which was identified by LEEP, during 
cloning and sequence analysis of the full-length ID- 104 gene sequence. 
Possible Shine Dalgamo sequence present 

upstream of a GTG start codon. Possesses a potential 
leader peptide sequence 



ID- 177 
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Clone 2-5b (ID- 112b) 

ATGGTTG AG CC AATTATTTC AAT AC AAG G ACTTC ATAAAA GTTTTG GG AAA 

AATGAGGTTTTAAAAGGCATTGACTTGGATATTCATCAAGGAGAAGTGGT 

GGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACATTTTTAAGAACAAT 

GAATCTCTTGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGGATTG 

ATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGC 

ATGGTTTTTCAACAGTTCAATCTATTTCCCAATATGACTGTACTA<3AAAAT 

ATTACTTTATCACCTATTAAGACAAAGGGACTTTCTAAGCTTGATGCTCAG 

ACAAAAGCATACGAGCTACTTGAAAAAGTTGGACTCAAAGAGAAGGCTAA 

TGCTTATCCAGCAAGCTTATCTGGAGGACAACAACAACGGATTGCTATTGC 

AAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAACCTACTTCA 

GCTCTTGATCCTGAAATGGTAGGTGAAGTCTTGACTGTTATGCAAGATTTA 

GCTAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCA 

CGTGAAGTAGCGGATCGTGTCATTtTTATGGATGCAGGGATTATTGTTGAG 

CAAGGGACCCCTAAGAAAGTATTTGAGCAGACAAAAGAAATCCGCACAAG 

AGACTTCTTAAGTAAAGTATTATAA 



MVEPIISIQGLHKSFGKNEVLKGIDLDIHQGEWVIIGPSGSGKSTFLRTMNLLE 

VPTKGTVTFEGIDITDKKNDIFKJV1REKMGMVFQQFNLFPNMTVLENITLSPIKT 

KGLSKLDAQTKAYELLEKVGLKEKANAYPASLSGGQQQRIAIARGLAMNPDV 

LLFDEPTSALDPEMVGEVLTVMQDLAKSGMTMVIVTHEMGFAREVADRVIF 

MDAGIIVEQGTPKKVFEQTKEIRTRDFLSKVL* 



Sequence description: 

A] Length: 735 bp - 244 aa (full length gene) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-1 12 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-1 12 gene sequence. 
Shine-Dalgamo sequence precedes the 'ATG* 

start codon. No obvious leader peptide 



ID- 178 

Clone 2-5c (ID-1 12c) 
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ATGTCTCAsTATCAAGAGTGGTTAGAAAACGACTCACTCGGTAAAGATATT 

AAGTC AGATTTAGAAGCTATTAAAGGAGATGAATCTGAAATTCAGGATCG 

TTTTTACAAAACATTAGAATTTGGAACGGCGGGATTGAGAGGTAAACTTG 

GAGCAGGAACCAATCGTATGAATACTTATATGGTGGGGAAAGCAGCACAA 

GCATTAGCTAATCGATTATTGATCATGGCCCTGAAGCTATTGCACGTGGAA 

TTGCAGTTAGTTATGATGTCCCGTTATCAATCTAAGGAATTTGCAGAATTA 

ACTTGGTCCATTATGGCAGCAAATGGTATTAAAGCCTTATATTTA 

MSHMNYKEIYQEWLENDSLGKDIKSDLEAIKGDESEIQDRFYKTLEFGTAGLR 
GKLGAGTNRMNTYMVGKAAQALANRLLIMALKLLHVELQLVMMSRYQSKE 
FAELTWSIMAANGIKALYL 



Sequence description: 

A] Length: 366 bp - 122 aa (partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-1 12 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-1 12 gene sequence. 
Shine-Dalgamo sequence preceded the 'ATG* 

start codon. No obvious potential leader peptide sequence. 

ID-1 79 

Clone 2-5d (ID-U2d) 

ATGCAACCTGTAAAAGTCGATGAACCTTCTGTTGAAGAAACCATTACTATT 

TTGAAAGGTATCCAAAAAAAATACGAAGATTATCATCACGTAAAATATAA 

TAATGATGCCATAGAAGCAGCTGCAGTACTATCTAATCGTTATATCCAAGA 

CCGCl 1 111 ACCTG ATAAAGC AATAGACTTATTAGATG AAGCTGGTTCTAA 

AATGAACCTAACACTAAATTTTGTTGATCCAAAAGAAATTGATCAACGTCT 

CATTGAAGCAGAAAATTTAAAAGCGCAAGCGACTCGTGAAGAAGATTACG 

AACGTGCAGCTTACTTCCGTGACCAGATTGCAAAATATAAAGAAATGCAG 

C AAC AAAAG GTCG ACG ATC AAG ATAC ACCTATTATTA C C G AAAAAAC AAT 

TGAGCACATCATTGAAGAAAAAACGAATATCCCTGTTGGTGATTTAAAAG 

AAAAAGAACAATCTCAATTAATTAATCTCGCAGATGACTTGAAACAGCAT 

GTGATCGGCCAGGATGACGCTGTCATTAAGATTGCAAAAGCTATTCGTCGT 

AATCGAGTTGGTCTTGGTAGCCCAAACCGTCCTATTGGTTCCTTTTTATTTG 

TAGGACCAACCGGTGTTGGTAAAACTGAACTTTCTAAACAACTAGCAATTG 

AGCTCTTTGGTTCAGCTGATAGTATGATTCGTTTTGATATGTCAGAGTACAT 

G G AAAAGC ATGCTGTTGCTAAATTAGTCGG AGCG CCTCCAGGATACGTGG 

GATACGAGGAAGCTGGACAACTAACTGAAAAGGTTCGTCGAAATCCTTAC 

TCGCTCATCCTTCTAGATGAAATTGAAAAAGCTCATCCCGATGTCATGCAT 



FIG. 1 CONTD 



82/110 



ATGTTCTTGCAGGTCCTTGATGACGGTCGATTAACAGATGGACAAGGAAG 
AACTGTTAGTTTTAAAGATACCATTATCATCATGACCTCAAATGCTGGTTC 
TGGTAAAACTGAAGCAAGTGTTGGCTTTGGTGCCTCACGAGAAGGTAGGA 
CGAATTCGAGCTCGGTACCCGGGGATCCTCTAGAGTCGACCTGCAGGCAT 

GCAAGC 

MQPVKVDEPSVEETITILKGIQKKYEDYHHVKYNNDAIEAAAVLSNRYIQDRF 

LPDKAIDLLDEAGSKMNLTLNFVDPKEIDQRLIEAENLKAQATREEDYERAAY 

FRDQIAKYKEMQQQKVDDQDTPIITEKTIEHIIEEKTNIPVGDLKEKEQSQLINL 

ADDLKQHVIGQDDAVIKIAKAIRRNRVGLGSPNRPIGSFLFVGPTGVGKTELSK 

QLAIELFGSADSMIRFDMSEYMEKHAVAKLVGAPPGYVGYEEAGQLTEKVRR 

NPYSLILLDEIEKAHPDVMHMFLQVLDDGRLTDGQGRTVSFKDTIIIMTSNAGS 

GKTEASVGFGASREGRTNSSSVPGDPLESTCRHAS 



Sequence description: 

A] Length: 1070 bp y 356 aa (Partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID-1 12 gene which was identified by LEEP, during 
cloning and sequence analysis of the full-length ID-1 12 gene sequence. Shine- 
Dalgamo sequence preceded the 'ATG* 

start codon. No obvious potential leader peptide 
sequence. 

ID-1 80 



Clone 2-7b (ID- 113b) 



ATG AGAGGGAAGGTTATTTACGGCAC AACCCTTATAGGTC 1" I" 1" 1" 1 CTATTC 

TTATTTTTCTATTTTTGGATTCCTAAGCATCACATCGAGAGAATACATCATC 

ATCGTATAAAGCAGGTAGATGCGAAGAGTGATTTAACAGGATTTAAAACC 

CATTTGCCCATTATCAGCATTGATACAAAGCAACAAGTTATTCCTCTTGTT 

ACAAAAGAAGGCGGAAAATATGTCAAAGCTAGGGATAATATTAATGTTGA 

TATCGAATTACGGGATTCTCCAAGTAGATCACATCATTTATCAGAAAAGCC 

GAGAATTAGGACAAAAGGGTTAATATCATATAGAGGAAATTCCTCTCGTT 

ACTTTGATAAGAAGTCATTGAAAGTTAAGTTTGTTACTAATAAGTTAAAGG 

AAAAGAAGCATCGATTAGCAGGAATGCCTAAAGAATCGGAGTGGGTATTG 

CATGGTCCCTTTCTAGACAGAACATTATTAAGAAATTATCTGAGTTATAAT 
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ATTGCTGGTGAGATTATGCCTATGCCCCAAACGTTCGCTACTGTGAGTTAT 
TTGTCAATGGTGAGTATCAGGGAG 

MRGKVIYGTTL1GLFLFLFFYFWIPKHHIERIHHHRIKQVDAKSDLTGFKTHLPII 
SIDTKQQV1PLVTKEGGKYVKARDNTNVD1ELRDSPSRSHHLSEKPRIRTKGLIS 
YRGNSSRYFDKKSLKVKFVTNKLKEiCKHRLAGMPKESEWVLHGPFLDRTLLR 
NYLSYNIAGEIMPMPQTFATVSYLSMVS1RE 



Sequence description: 

A] Length: 582 bp - 194 aa (Partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-113 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID- 1 13 gene sequence. 
ATG start codon is preceded by a Shine- 

Dalgarno sequence-Possesses a potential leader peptide 
sequence. C-terminus to be determined. 



ID-181 



Clone 2-17b(ID-117b) 



CTTCACATTTTATTGATCACTATCTGACAAATGTTAATCAAACAGCAGTTCT 

TATTTTAGTGGGATATTATTCAATGTATGTCTTGCAGACCTTAATTCAATAT 

TTTGGGA ATCTC 1 1 111 I GCGCGTGTTTCTTATAGTATTGTTAGAGATATTC 

GTAGAGATGCTTTTGCTAATATGGAAAGGCTAGGCATGTCTTATTTTGATA 

GGACACCGGCAGGATCTATTGTGTCACGTATTACTAATGATACTGAAGCAA 

TATCTGATATGTTTTCGGGTATTTTATCAAGTTTTATCTCGGCGATATTTAT 

TTTTACAGTTACTCTGTACACTATGTTGATGCTAGACATTAAACTAACAGG 

ACTCGTCGCTCTTTTGTTACCTGTTATCTTTATATTAGTGAATGTCTATCGG 

AAAAAATCAGTCACTGTCATTGCTAAAACGAGAAGTTTACTTAGTGATATC 

AACAGTAAATTATCAGAAAGTATTGAAGGAATTC 

SHFIDHYLTNVNQTAVLlLVGYYSMYVLQTLIQYFGNLFFARVSYSrVRDIRRD 

AFANMERLGMSYFDRTPAGSIVSR1TNDTEAISDMFSG1LSSFISAIFIFTVTLYT 

MLMLDIKLTGLVALLLPVIFILVNVYRKKSVTVIAKTRSLLSDINSKLSESIEGI 
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Sequence description: 

A] Length: 498 bp - 165 aa (Partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-1 17 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-1 17 gene sequence. 
N- and C-termini have yet to be determined 



ID-1 82 



Clone 3-8b (ID-1 20b) 



ATGTACCATATTGAATTAAAAAAGGAAGCTTTACTACCAAGAGAACGCCT 

AGTTGATTTAGGCGCAGATAGATTGAGTAATCAGGAGTTATTAGCCATTCT 

CTTACGTACAGGTATTAAAGAAAAACCTGTTCTTGAAATTTCAACGCAAAT 

TTTAGAAAACATAAGCAGTTTAGCAGATTTTGGTCAATTATCCTTACAGGA 

GTTGCAATCCATTAAAGGAATCGGTCAGGTTAAATCCGTCGAAATAAAAG 

CTATGCTAGAACTAGCAAAACGGATTCACAAAGCTGAATATGATCGTAAA 

GAGCAAATTTTAAGTAGTGAACAATTAGCGAGGAAAATGATGCTCGAATT 

AGG G G ATAAAA AAC AAG A AC ATTTAGTAG CTATTTATATG GAT AC AC AAA 

ATCGTATTATCGAACAGAGAACTATTTTTATTGGTACTGTACGTCGTTCAG 

TAGCAGAGCCAAGAGAAATTCTACATTATGCTTGTAAAAACATGGCAACT 

TCTTTGATTATTATACATAATCATCCCTCAGGTTCTCCAAATCCCAGTGAAA 

GTGATTTAAGTTTCACTAAAAAAATAAAACGATCATGTGATCATC TGGGA A 

TTGTCTGCCTAGATCACATCATCGTTGGAAAAAATAAATATTATAGTTTTC 

GAGAAGAAGCAGATATTTTATAA 

MYHIELKKEALLPRERLVDLGADRLSNQELLAILLRTGIKEKPVLEISTQILENI 
SSLADFGQLSLQELQSIKGIGQVKSVEIKAMLELAKRIHKAEYDRKEQILSSEQ 
LARKMMLELGDKKQEHLVAIYMDTQNRIIEQRTIFIGTVRRSVAEPREILHYAC 
KNMATSLIIIHNHPSGSPNPSESDLSFTKKIKRSCDHLGIVCLDHirVGKNKYYSF 

REEADIL* 
Sequence description: 

A] Length: 681 bp - 227 aa (full-length gene) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-1 20 gene which was identified by LEEP, 
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during cloning and sequence analysis of the full-length ID- 120 gene sequence. 
ATG start codon is preceded by an typical 
Shine-Dalgamo sequence. No obvious leader 
peptide sequence 



ID-183 



Clone 3-llb(ID-121b) 



TGGTTAAAAGTAGTGATAGCTTGTATTCCATCTATTTTAATTGCTTTACCAT 

TTG ATA ATTG G TTT G AAG C TC ATTTTAATTTC ATG ATTCCG ATTG C AATA GC 

CCTAATCTTTTATGGTTTTGTCTTCATATGGGTTGAAAAACGTAATGCACAC 

CTCAAACCACAGGTAACCGAATTGGCAAGTATGTCTTACAAGACAGCTTTC 

TTGATTGGATGTTTCCAGGTTCTCAGTATTGTTCCGGGAACCAGTCGTTCTG 

GAGCTACTATTTTAGGAGCAATTATTATTGGAACTAGTCGTTCGGTCGCTG 

CTGACTTTACTTTCTT CCTTGC CATCCCAACTATGTTTGGTTATAGTGGACT 

TAAGGCG GTTAA ATATTTTTTAGATGGTAACGTCTTGAGTTTAGACCAATC 

TTTAATACTTTTAGTAGCAAGTCTGACAGCTTTCGTAGTTAGTTTATATGTT 

ATTCGTTTCTTGACAGACTATGTCAAACGACACGATTTCACCATCTTTGGT 

A AGT ATCGTATAGTCTTAGGAAGTTTACTCATCCTCTACTGGTTAGTTGTTC 

ATTTATTCTAA 



WLKVVIACIPSILIALPFDNWFEAHFNFMIPIAIALIFYGFVFIWVEKRNAHLKP 
QVTELASMSYKTAFLIGCFQVLS1VPGTSRSGATILGAIIIGTSRSVAADFTFFLA 
IPTMFGYSGLKAVKYFLDGNVLSLDQSLILLVASLTAFVVSLYVIRFLTDYVKR 
HDFTIFGKYRTVLGSLLILYWLVVHLF* 

Sequence description: 

A] Length: 579 bp - 193 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-68 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-68 gene sequence 
described in WO 00/06736. N-terminus has yet to be determined. 

ID- 184 



Clone 3-llc(ID-121c) 
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ATGGAAATGAAACAAATCAGTGAAACAACACTGAAAATTACAATTAGTAT 

GGAAGATTTAGAAGATCGTGGTATGGAGCTGAAAGATTTCCTAATCCCTCA 

GGAG AAG ACTG AGGA ATTTT1 CTATTCTGTC ATG G ATG AATTAG ACTTGCC 

AGAAAACTTTAAAAATAGTGGTATGTTAAGTTTTCGAGTAACACCTAAAA 

AAGATCGCATTGATGTTTTTGTTACAAAGTCTGAATTAAGTAAAGATTTAA 

ATTTAGAAGAATTAGCAGATTTGGGTGACATTTCAAAAATGTCTCCAGAAG 

ACTTTTTTAAAACCTTGGAACAATCGATGTTGGAAAAAGGGGATACGGAT 

GCCCATGCCAAATTAGCAGAAATTGAAAATATGATGGATAAAGCAACTCA 

AGAAGTAGTTGAGGAAAATGTTTCTGAAGAACAACCTGAAAAGGAAGTAG 

AAACGATTGGATATGTTCACTATGTCTTTGATTTTGATAATATTGAAGCTGT 

AGTTCGATTTTCACAAACGATTGATTTTCCAATAGAAGCTT 

MEMKQISETTLKITISMEDLEDRGMELKDFLIPQEKTEEFFYSVMDELDLPENF 
KNSGMLSFRVTPKKDRIDVFVTKSELSKDLNLEELADLGDISKMSPEDFFKTLE 
QSMLEKGDTDAHAKLAEIENMMDKATQEVVEENVSEEQPEKEVETIGYVHY 
VFDFDNIEAVVRFSQT1DFPIEA 



Sequence description: 

A] Length: 547 bp - 1 82 aa (Partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-68 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-68 gene sequence. 
ATG start codon is preceded by an typical 

Shine-Dalgamo sequence. No obvious potential 

leader peptide 

sequence 



ID- 185 



Clone 3- 16b (ID- 122b) 



GGAAACCAACGGCCAGTACAATCGTCAAGGGTAGATTATCCTAAACGTAG 

TCGTGCCAAGATTGTAGAAGTTTATTTTAGACAAGCTTCTACTACTGATTA 

TTCTGGTGTTTACAAAGGTTACTATATTGACTTTGAAGCCAAAGAAACCCG 

GCAGAAAACTGCTATGCCTATGAAAAATTTTCATGCTCACCAAATAGAGC 

ACATGGCAAATGTATTACAGCAAAAAGGGATTTGCTTTGTCTTGCTTCATT 
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TTTCCACACTTAAGGAAACCTATCTACTCCCTGCTAATGAGTTAATTTCATT 
TTATCAGATTGATAAAGGCAATAAATCAATGCCTATTGATTATATCAGAAA 
AAATGGATTTTTCGTAAAGGAGAGTGCCTTTCCTCAAGTCCCTTACTTAGA 
TATTATTGAAGAAAAATTATTAGGCGGTGATTACAATTAA 

GNQRPVQSSRVDYPKRSRAKIVEVYFRQASTTDYSGVYKGYYIDFEAKETRQ 
KTAMPMKNFHAHQIEHMANVLQQKGICFVLLHFSTLKETYLLPANELISFYQI 
DKGNKSMPIDYIRKNGFFVKESAFPQVPYLDIIEEKLLGGDYN* 



Sequence description: 

A] Length: 447 bp - 1 49 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID- 122 gene which was identified by LEEP, during 
cloning and sequence analysis of the full-length ID-122 gene sequence. N- 
terminus has yet to be determined 



ID-186 



Clone 3-17b (ID-123b) 



GGATCCTAAAAACGCTAAGGTTTATCAAAAAAATGCTGATCAATTTAGTG 

ACAAGGCAATGGCTATTGCAGAGAAGTATAAGCCAAAATTTAAAGCTGCA 

AAGTCTAAATACTTTGTGACTTCACATACAGCATTCTCATACTTAGCTAAG 

CGATACGGATTGACTCAGTTAGGTATTGCAGGTGTCTCAACCGAGCAAGA 

ACCTAGTGCTAAAAAATTAGCCGAAATTCAGGAGTTTGTGAAAACATATA 

AGGTTAAGACTA 1 II 1 1 GTTGAAGAAGGAGTCTC ACCTAAATTAGCTCAAG 

CAGTAGCTTCAGCTACTCGAGTTAAAATTGCAAGTTTAAGTCCTTTAGAAG 

CAGTTCCCAAAAACAATAAAGATTACTTAGAAAATTTGGAAACTAATCTTA 

AG GTACTTGTCAA ATCGTTAAATC A ATAG 



DPKNAKVYQKNADQFSDKAMA1AEKYKPKFKAAKSKYFVTSHTAFSYLAKR 

YGLTQLGIAGVSTEQEPSAKKLAE1QEFVKTYKVKTIFVEEGVSPKLAQAVAS 

ATRVKIASLSPLEAVPKNNKDYLENLETNLKVLVKSLNQ* 



Sequence description: 
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A] Length: 433 bp - 144 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID- 123 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID- 123 gene sequence. 
N-terminus has yet to be determined 



ID-187 



Clone 3-46/47 (ID-130b) 



ATGAAAAAAGTCATCGATTTAAAAAAACTACAAAAAGCATACGCCTCAGA 

AACTGTTTTAAATAATATTAATTTGGAGGTGTTTAAAGGAGAAATAATTGG 

ATTAATAGGACCCTCTGGAGCAGGGAAATCTACCTTGATTAAAACTATGCT 

TGGCATGGAAAAAGCAGATAAGGGAACAGCTCTTGTTCTTGATACTCAAA 

TGCCAGATCGTAATATTTTAAATCAAATTGGCTATATGGCTCAATCTGATG 

CCTTACACGAGTCTTTAACTGGCTTAGAAAATTTATTATTCTTTGGAAAAA 

TGAAAGGTATTCAAAAAACTGAATTAAAACAGCAGATAACTCATATTTCT 

AAAGTAGTAGATCTAGAAAACCAACTTGATAAATTTGTCTCAGGTTACTCA 

GAAGGTATGAAAAGACGGCTTTCTCTAGCCATCGCCCTACTTGGAAACCCC 

ACAGTTTTAATCCTAGATGAACCTACCGTTGGAATTGATCCATCCTTGAGG 

AGAAAAATCTGGCAAGAGCTAATTAATATTAAGGATGAAGGACGTTCTAT 

CTTTATTACAACCCACGTTATGGATGAAGCAGAATTAACAAGTAAGGTTGC 

ACTACTATTACGTGGAAACATTATTGCCTTTG ATACT CCATTACATTTAAA 

AAAACAATTTAATGTGAGTACTATTGAGGAAGTTTTCTTAAAAGCTGAAGG 

AGAATAA 

MKKVIDLKKLQKAYASETVLNNINLEVFKGEIIGLIGPSGAGKSTLIKTMLGME 

KADKGTAL VLDTQMPDRN I LN QIG YMAQSD ALHES LTGLEN LLFFGKMKGIQ 
KTELKQQITHISKVVDLENQLDKFVSGYSEGMKRRLSLAIALLGNPTVL1LDEP 
TVG1DPSLRRKIWQELINIKDEGRSIFITTHVMDEAELTSKVALLLRGNIIAFDTP 

LHLKKQFN VSTIEE V FLKAEGE * 



Sequence description: 

A] Length: 717 bp - 239 aa (Possible full-length sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID-130 gene which was identified by LEEP, during 
cloning and sequence analysis of the full-length ID-130 gene sequence. ATG 
start codon is preceded by a possible 
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Shine-Dalgarno. No obvious potential leader 
peptide sequence 



1D-188 



Clone 3-83b (ID-144b) 

ATGGTACAAATGATACATGATATGATTAAAACAATTGAGCATTTTGCTGAG 

AC AC AAGCTG A 1 TTI CC AGTGTATG ATATTTTAGGGGAAGTCC ATACTTAT 

GGACAACTTAAAGTAGACTCTGACTCTCTAGCTGCTCATATTGATAGCCTA 

GGCCTTGTTGAAAAATCACCTGTCTTAGTATTCGGTGGTCAAGAATATGAA 

ATGTTGGCGACATTTGTTGCTTTAACAAAGTCAGGGCATGCTTATATACCG 

GTTGACCAACACTCTGCTTTGGATAGAATACAGGCTATTATGACAGTTGCT 

CAACCAAGCCTTATCATTTCAATTGGTGAATTTC CTCTT GAAGTTGATAAT 

GTCCCAATCCTAGACGTTTCTCAAGTTTCAGCTATTTTTGAAGAAAAGACT 

CCTTATGAGGTAACACATTCTGTTAAAGGTGATGATAATTACTATATTATT 

TTCACTTCAGGGACTACTGGTTTACCAAAAGGTGTGCAAATTTCACATGAC 

AATTTATTGAGCTTTACAAATTGGATGATTTCTGATGATGAGTTTTCAGTTC 

CTGAAAGACCGCAAATGTTGGCTCAACCC 

MVQMIHDMIKTIEHFAETQADFPVYDILGEVHTYGQLKVDSDSLAAHIDSLGL 
VEKSPVLVFGGQEYEMLATFVALTKSGHAYIPVDQHSALDRIQAIMTVAQPSL 
IISIGEFPLEVDNVPILDVSQVSAIFEEKTPYEVTHSVKGDDNYYIIFTSGTTGLP 
KGVQISHDNLLSFTNWMISDDEFSVPERPQMLAQP 



Sequence description: 



A] Length: 592 bp - 197 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID- 144 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID- 144 gene sequence. 
Putative ATG start codon is preceded by a 

typical Shine-Dalgarno sequence. No obvious 

leader peptide sequence 

This orf is not in frame with nuc 



ID-189 
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Clone 3-86b (ID-145b) 



ATGGAAAATCATCGTTATGAAGATGAAGGTAAATTCCAGCGTAAGATGAC 

CAGTCGTCATCTCTTTATGTTATCGCTAGGTGGTGTTATCGGGACTGGGCTT 

TTCTTGAGTTCAGGTTATACCATTGCACAGGCTGGTCCGCTTGGAGCTGTG 

CTGTCTTATTTG ATTG GTGCCGTTGTG GTTT ATTTG GTC ATGCT ATC ACTTG 

GGGAATTGGCGGTTGCCATGCCGGTGACGGGGTCATTCCACACTTATGCCA 

CTAAGTTTATCAGTCCTGGAACAGGTTTTACTGTTGCTTGGCTATATTGGAT 

TTGTTGGACGGTCGCCTTGGGGACTGAAl'll 1 1 AGGTGCTGCCATGCTGAT 

GCAGCGCTGGTTCCCAAATGTGCCGGCTTGGGCATTTGCTTCCrill TTGCC 

CTTGTGAI'J l'J TGGTTTAAATGCTCTTAGCGTACGC11 Till GCAGAAGCAG 

AGTCTTTCTTCTCAAGTATTAAGGTTATTGCTATCATTATCTTTATTATCTTG 

GGCTTAGGTGCTATGTTTGGTCTAGTTTCCTTTGAAGGTCAGCACAAGGCT 

ATTCTCTTCACTCATCTGACTGCCAATGGTGCCTTTCCAAATGGTATCGTTG 

CAGTTGTCTCAGTCATGTTGGCTGTTAACTATGCCTTCTCTGGTACTGAGTT 

AATTGGTATTGCGGCTGGTGAAACGGATAATCCCAAAGAAGCTGTACCAA 

GGGCTATTAAAACGACAATCGGTCGCTTGGTTGTTTTCTTTGTACTGACAA 

TTGTTGTCCTAGCTTCGCTATTGCCAATGAAAGAGGCAGGCGTATCCACAG 

CACCATTCGTTGATGTCTTTGACAAGATGGGAATCCC 1 1 11 ACGGCGGATA 

TCATGAACTTCGTTATCTTGACAGCCATCCTGTCTGCTGGTAACTCAGGTCT 

CTACGCATCAAGCCGTATGCTCTGGTCCCTTGCCAATGAAGGTATGTTGTC 

AAAATCTGTTGTGAAAATCAATAAACACGGTGTCCCAATGCGTGCTCTTCT 

CTTGTCAATGGCAGGAGCAGTGCTGTCGCTCTTTTCAAGTATTTACGCTGC 

AGACACAGTTTATCTAGCCTTGGTTTCAATCGCGGGCTTTGCTGTTGTTGTC 

GTATGGCTAGCCATTCCAGTCGCACAAATCAATTTCCGCAAGGAATTC 

MENHRYEDEGKFQRKMTSRHLFMLSLGGVIGTGLFLSSGYTIAQAGPLGAVL 

SYLIGAVVVYLVMLSLGELAVAMPVTGSFHTYATKFISPGTGFTVAWLYWIC 

WTVALGTEFLGAAMLMQRWFPNVPAWAFASFFALVIFGLNALSVRFFAEAES 

FFSSIKVIAIIIFIILGLGAMFGLVSFEGQHKAILFTHLTANGAFPNGIVAVVSVM 

LAVNYAFSGTELIG1AAGETDNPKEAVPRAIKTT1GRLVVFFVLT1VVLASLLPM 

KEAGVSTAPFVDVFDKMGIPFTADIMNFVILTAILSAGNSGLYASSRMLWSLA 

NEGMLSKSVVKINKHGVPMRALLLSMAGAVLSLFSSIYAADTVYLALVSIAGF 

AVVVVWLAIPVAQINFRKEF 



Sequence description: 



A] Length: 1 126 bp - 393 aa (partial gene 
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sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-145 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-145 gene sequence. 
Putative ATG start codon is preceded by a 
typical Shine-DaJ garno sequence. Possesses a 
possible leader peptide sequence. 



ID- 190 



Clone 3-94b 



TCAGAAAATGCAGAGGCAGCAACGGTTGCCACAAACTTGGTTACCAAAGG 

AGCTAATGTCATTATCGGACCAGCAACATCGGGTGCAGCTGCATCTTCAAC 

TCCAAAAGTAAATGCAGCAGCAGTTCCAATGATTGCACCTGCTGCGACAC 

AAGACAATTTAGTCTATGGTTCTGATGGAAAAACCTTAAATCAGTATTTCT 

TCCGAGCTACTTTTGTCGATAATTATCAAGGAAAGCTATTGTCTCAGTATG 

CTACAGACAACCTTAAAGCTAAAAAAGTTGTTCTATTTTATGATAATTCAT 

CAGATTACTCAAAGGGGGTAGCAAAATCATTTAAGGAAAGTTATAGTGGA 

AAAATTGTTGATAGTATGACATTCTCCGCTGGTGATACTGATTTCCAAGCG 

TCATTGACTAAGTTGAAAGGGAAAGAATATGATGCTATTGTGATGCCAGG 

TTACTATACCGAGACAGGATTAATAGTTAAGCAAGCGCGTGATTTAGGTAT 

CTCTAAACCGGTTCTTGGGCCTGATGGTTTTGATAGTCCGAAATTTGTGCA 

ATCGGCAACACCTGTAGGAGCTTCAAACGTTTATTATTTGACAGGTTTCAC 

TACACAAGGATCAACCAAAGCTAAAGCT 

SENAEAATVATNLVTKGANVIIGPATSGAAASSTPKVNAAAVPMIAPAATQD 
NLVYGSDGKTLNQYFFRATFVDNYQGKLLSQYATDNLKAKKVVLFYDNSSD 
YSKGVAKSFKESYSGKIVDSMTFSAGDTDFQASLTKLKGKEYDAIVMPGYYT 
ETGLIVKQARDLGISKPVLGPDGFDSPKFVQSATPVGASNVYYLTGFTTQGST 
KAKA 



Sequence description 



A] Length: 637 bp - 23 1 aa (partial sequence) 
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B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID- 149 gene which was identified by LEEP, during 
cloning and sequence analysis of the full-length ID- 149 gene sequence. N- and 
C-termini have yet to be determined 



ID-191 



Clone 2-c94b (ID- 153b) 



TTGGGACTTAAAGACCATGCTTTAGTCTATCCATTTTCATTATCTGGGGGG 

CAAAAGCAACGTGTCGCACTAGCTCGTGCGATGATGATTGATCCACAGATT 

ATTGGTTATGATG AGCC AACTAGCGCTCTTGATCCAGAGTTGCGTCAAGAA 

GTAGAAAAACTAATTTTACAAAATAGAGAAACAGGTATGACACAAATTGT 

AGTAACACATGATCTTCAATTTGCTGAAAGTATATCTGATACGATTCTCAA 

AATTAATCCTAAGTAG 

MGLKDHALVYPFSLSGGQKQRVALARAMMIDPQIIGYDEPTSALDPELRQEV 
EKLILQNRETGMTQI V VTHDLQF AE SI SDTI LKINPK * 



Sequence description 

A] Length: 270 bp - 90 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID- 153 gene which was identified by LEEP, during 
cloning and sequence analysis of the ID-153 gene sequence. 

N-terminus has yet to be determined 



ID-192 

Clone 2-clb(ID-155b) 



ATGACTAATATCTCAGATGTTCCAAAAGCTATTAGAACACAGGCACAGTAT 
GTTCTCTTGGGAATGAGAGTTATGGATCAGTCGGTATTACCGAAAACATAT 
AATTCAAAAGAACCTTATTTGAAACCAGATATGATTTATATTCATGATAGA 
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AGACAAGAGACAATGCTTAAAATCACTCAAGAAATAGAAATGGAGCATTG 
A 

MTNISDVPKAIRTQAQYVLLGMRVMDQSVLPKTYNSKEPYLKPDMIYIHDRR 
QETMLKITQEIEMEH * 



Sequence description 



A] Length: 204 bp - 68 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the JD-I55 gene which was identified by LEEP, during 
cloning and sequence analysis of the ID- 155 gene sequence. 

ATG start codon is preceded by a potential typical Shine-Dalgarno sequence. 
Has a 

typical leader peptide. N-terminus has yet to be 
verified 



ID- 193 

Clone 2-54altb (ID- 172b) 

AAGCTTGCATGCCTGCAGGTCGACTCTAGAGGATCTTGGGGAATATAAATT 

TGGATTTCATGACGATGTAAAGCCAATTTATTCTACGGGAAAAGGTCTAAA 

TGAGGCTGTTATTCGTGAGTTATCTGCAGCTAAGGGTGAACCTGAGTGGAT 

GTTGGACTTTCGTCTAAAATCCTTGGAAACGTTTAATAAAATGCCGATGCA 

GACCTGGGGAGCAGATTTATCAGATATTGATTTTGATGATATTATTTATTA 

TCAAAAAGCATCTGATAAACCTGCGCGTGATTGGGATGATGTTCCAGAAA 

AAATCAAAGAAACTTTTGAAAGAATTGGGATTCCAGAAGCTGAAAGAGCC 

TATCTTGCAGGAGCATCAGCACAATATGAATCAGAAGTAGTTTATCACAAT 

ATGAAAGAAGAATATG ATAAGCTGGGTATTG 1 T 1 1 1 ACGGATACTG ACTCT 

GCACTTAAAGAGTACCCAGAGCTATTCAAAAAATATTTTGCTAAACTTGTC 

CCTCCAACAGATAATAAATTAGCTGCTCTGAACTCTGCTGTATGGTCAGGT 

GGAACATTTATTTATGTTCCTAAAGGTGTTAAGGTGGATATTCCACTTCAA 

ACTTACTTCCGTATTAATAATGAAAATACTGGACAATTTGAACGTACTCTC 

ATTATTGTTGATGAGGGAGCAAGTGTTCACTATGTTGAAGGTTGTACCGCC 

CCAACTTATTCTTCAAATAGTTTACATGCAGCTATAGTTGAAATTTTTGCAC 

TTGATGGAGCTTATATGCGCTATACGACTATTCAAAATTGGTCCGATAATG 

TCT ATAATTTAGTG AC AAAAC GTG CTACCG CTAAAAAAG ATG C AAC AGTT 

GAGTGGATAGATGGAAATCTAGGAGCTAAAACAACAATGAAATACCCATC 
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GGTTTACCTTGATGGTGAAGGAGCACGTGGCACGATGTTGTCTATTGCTTT 

TGCAAACAAAGGACAACACCAAGATACGGGTGCAAAGATGATTCATAATG 

CCCCCCATACTAGTTCATCCATTGTCTCTAAATCAATTGCTAAGGGTGGGG 

GAAAAGTTGATTATCGAGGTCAAGTGACATTTAATAAAGATTCCAAAAAA 

TCAGTGTCACATATAGAATGTGACACCATATTGATGGATGATATTTCAAAA 

TCAGATACCATACCGTTTAATGAGATTCATAATTCACAGGTTGCTTTAGAG 

CATGAAGCAAAGGTGTCTAAGATTTCTGAAGAGCAACTGTACTACTTGATG 

AGTCGAGGTTTATCTGAAGCTGAAGCAACAGAAATGATTGTTATGGGGTTT 

GTTGAGCCCTTTACGAAAGAATTACCAATGGAATATGCGGTAGAGTTAAA 

TCGTTTAATTTCCTATGAAATGGAAGGTTCAGTTGGTTAA 



MHACRSTLEDLGEYKFGFHDDVKPIYSTGKGLNEAVIRELSAAKGEPEWMLD 

FRLKSLETFNKMPMQTWGADLSDIDFDD1IYYQKASDKPARDWDDVPEKIKE 

TFERIGIPEAERAYLAGASAQYESEWYHNMKEEYDKLGIVFTDTDSALKEYP 

ELFKKYFAKLVPPTDNKLAALNSAVWSGGTFIYVPKGVKVDIPLQTYFRINNE 

NTGQFERTLIIVDEGASVHYVEGCTAPTYSSNSLHAAIVEIFALDGAYMRYTn 

QNWSDNVYNLVTKRATAKKDATVEWIDGNLGAKTTMKYPSVYLDGEGARG 

TMLSIAFANKGQHQDTGAKMIHNAPHTSSSIVSKSIAKGGGKVDYRGQVTFN 

KDSKKSVSHIECDTILMDDISKSDTTPFNEIHNSQVALEHEAKVSKISEEQLYYL 

MSRGLSEAEATEMIVMGFVEPFTKELPMEYAVELNRJL1SYEMEGSVG* 



Sequence description: 

A] Length: 1411 bp - 469 aa (Possible full-length gene) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID- 72 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-72 gene sequence. 
No obvious Shine Dalgarno sequence upstream of 

TTG start codon insufficient sequence data). N 
terminus needs verification. 



ID- 194 

Clone3-lb(ID-8Ib) 

ATGATAGAATTCTTTTCTAATATCAGAACAGAGATTCCGCAGATGCCTTTA 
CTTATCCATAGTTTGATTTTATCTGTCTTACCTTTTCTGATGTGG CTGAC TTT 
GGTTAATAGAGATAAGCCTTTGTATAAAACTATTTGGAGTATCCTTTTAGG 
ACTTCAGTTAATTACGATTTATACTTGGTTTTTCTGGGCAAAATTGCCTTTA 
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TCTG AAAGTCTTCCCCTTT ACC ATTGTC G AATAG G C ATGTTTGTCG GTCTCT 
TA 



MIEFFSNIRTEIPQMPLLIHSLILSVLPFLMWLTLVNRDKPLYKTIWSILLGLQLI 
TIYTWFFWAKLPLSESLPLYHCPvIGMFVGLL 



Sequence description 

A) Length: 261 bp - 87 aa (partial gene sequence) 

B) This gene sequence was not identified using the LEEP system. It was identified 
downstream of the ID-81 gene which was identified by LEEP, during cloning and 
sequence analysis of the full-length ID-81 gene sequence. Sequence Characteristics: 
Possesses a potential leader peptide sequenceOrf is preceded by a potential Shine- 
Dalgamo sequence. 



ID- 195 



Clone RS-55b 



AAGCTTGTGCAAAGTATTAAAGAGATAGGATTAGCTAATGCGCATTTATTA 

GCTGTTGCTCCGACAGGGTCAATCAGTTATCTTTCTTCTTGTACTCCGAGCC 

TTCAACCGGTTGTATCACCTGTCGAAGTACGCAAGGAAGGAGCACTGGGG 

AGGGTTTATGTAGCTGCTTATAAGATTGATGCAGATAATTATGTCTACTAC 

AAAAAAGGAGCTTATGAAGTGGGATCTGAGGCGATTATCAATATTGCAGC 

TGCCGCTCAAAAACACATTGATCAAGCTATTTCGTTAACGCTTTTCATGAC 

AGATCAAGCAACTACGCGAGATTTAAATAAAGCCTATATTCAAGCATTTA 

AACAAAAATGTGCCTCTATTTATTATGTACGAGTGAGACAGGACATCCTAG 

AAGGTAGCGAGAGTTATGATGATATGCTGGATGATTTCACTTCATCGGACT 

TAGAAGACTGTCAATCCTGCATGATTTAA 



>KLVQSIKEIGLANAHLLAVAPTGSISYLSSCTPSLQPWSPVEVRKEGALGRV 
YVAAYKIDADNYVYYKKGAYEVGSEAIIN1AAAAQKHIDQAISLTLFMTDQAT 
TRDLNKAYIQAFKQK.CASIYYVRVRQDILEGSESYDDMLDDFTSSDLEDCQSC 
MI* 



Sequence description: 
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A] Length 486 bp - 162 aa (Partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was identified 
upstream of the ID-87 gene which was identified by LEEP, during cloning and 
sequence analysis of the full-length ID-87 gene sequence. N-terminus to be 
determined. 



ID-196 

Clone RS-59(ID-90b) 



GTGAGGACATATATTACAAACTTGAATGGACATTCAATCACTAGTACAGC 

ACAAATAGCTCAAAACATGGTAACAGATATAGCAGTAAGCTTAGGTTTTC 

GTGAGCTGGGAATACATTCTTATCCGATTGATACTGATTCTCCTGAGGAAA 

TGAGTAAGCGTTTAGATGGAATCTGTTCCGGACT TAGA AAAAATGATATTG 

TCATATTTCAGACACCTACATGGAACACTACAACTTTTGATGAAAAATTAT 

TTCACAAATTAAAAATATTTGGTGTAAAGATTGTTATTTTTATACATGATGT 

TGTACCGCTAATGTTTGATGGAAATTTTTATTTGATGGATAGAACTATAGC 

TTATTATAATGAAGCAGATGTTTAATAGCCCCTAGTCAAGCAATGGTCGAT 

AAGCTT 

MRTYITNLNGHSITSTAQIAQNMVTDIAVSLGFRELGIHSYPIDTDSPEEMSKRL 
DGICSGLRKNDrVIFQTPTWNTTTFDEKLFHKLKIFGVKIVIFIHDVVPLMFDGN 
FYLMDRTIAYYNEADVLIAPSQAMVDKL 



Sequence description: 

A] Length: 414 bp - 138 aa(partial gene) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-90 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-90 gene sequence. 
No obvious signal peptide, but a 

possible Shine Dalgamo sequence is present 
upstream of ATG start codon. C-terminus has yet 
to be determined. 



ID- 197 



Clone RS-59c (ID-90c) 
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CATGGAAATGAAGTTGATGATGTTATTAGAAGGGCATTTGAATATAATCAC 

CTTATCTTTGCTTTTGATAATACCTGTCATAACAGAGAGTTAGTATTAGATA 

GCAATATCATTTCTCACACAACCTGTGAACAATTGATAAATTTAATGAAAA 

ATTTATCAGGCTCCATTATGTATTTGCTAGAGCAACAAAGAGAACAAACA 

AGTAATGAAACAAAAGAGCGTTATAAAGAAATATTAGGAGGGTATGGAA 

ATGCCTAA 

HGNEVDDVIRRAFEYNHLIFAFDNTCHNRELVLDSNIISHTTCEQLINLMKNLS 
GSIMYLLEQQREQTSNETKERYKEILGGYGNA* 



Sequence description: 

A] Length: 261 bp - 87 aa(partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID-90 gene which was identified by LEEP, during 
cloning and sequence analysis of the full-length ID-90 gene sequence. N- 
terminus has yet to be determined 



ID-198 



Clone RS-70b (ID-93b) 

ACAi 1 1 1 ' 1 ATAT TATGTATTTG AAGACGTAGCCACCC AGTC AAATATGACT 

GGG AAGATT TTTAGTATGTCTAAAGAAGAGTTGTCATATTTACCCGTTATT 

AAACTTTTTAAGAATCAAGGTGTATACAACGGCTTGATTGGTCTATTCCTC 

CTTTATGGGTTATATATTTCACAGAATCAAGAAATTGTAGCTATTTTTTTAA 

TCAATGTGTTGCTAGTTGCTGTTTATGGTGCTTTGACAGTTGATAAAAAAA 

TCTTATTAAAACAGGGTGGTTTACCTATATTAGCTCTTTTAACATTCTTATT 

TTAA 

TFLYYVFEDVATQSNMTGKIFSMSKEELSYLPVIKLFKNQGVYNGLIGLFLLY 
GLYISQNQErVAIFLINVLLVAVYGALTVDKKILLKQGGLPILALLTFLF* 



Sequence description: 



A] Length: 312 bp - 104 aa (partial gene sequence) 
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B] This gene sequence was not identified using the LEEP system. It 
was identified upstream of the ID- 93 gene which was identified by 
LEEP, during cloning and sequence analysis of the full-length ID-93 
gene sequence. 

N-terminus has yet to be determined 



ID- 199 

Clone RS-70c (ID-93c) 

ATGAAATTAAGTGTCCTTGATTATGGGCTTATTGATTATGGAAAAACTGCA 

AGTGATGCAATACAAGAAACGATTCTTTTATCACAAGAGGCGGAGCAACT 

AGGCTATCATCAATTTTGGGTGGCTGAACATCACGGTGTTAAGGCATTCAG 

TATTAGCAATCCAGAATTAATGATAATGCATTTGGCTAACCAGACTAAATC 

TATCAAAATTGGCTCTGGAGGTATAATGCCTCTGCACTATAGTAGTTTTAA 

ACTCGCGGAGACTCTCAAGACATTAGAGACATGTCATCCGAATCGAGTAA 

GTATTGGTTTAGGAAATTCACTAGGGACAGTTAAAGTTTCAAATGCACTTC 

GTAGCTTACATAAAGCACATGATTACGAAGAGGTACTGGAGGAATTGAAG 

TCATGGCTTATTGATGAATCATCCAGTAAGGAACCATTAGTTCAACCGACT 

CTTTCTAGCTTCCCAGACTTATATGTGTTGGGGAGTGGTCAAAAATCAGCT 

TATTTAGCGGCTAAACTTGGCTTAGGCTTTACCTTCGGTGTTTTTCCTTTTA 

TGGACAAAGACCCATTGACAGAAGCTAAA 

MKLSVLDYGLIDYGKTASDAIQETILLSQEAEQLGYHQFWVAEHHGVKAFSIS 
NPELMIMHLANQTKSIKIGSGG1MPLHYSSFKLAETLKTLETCHPNRVSIGLGN 
SLGTVKVSNALRSLHKAHDYEEVLEELKSWLIDESSSKEPLVQPTLSSFPDLYV 
LGSGQKSAYLAAKLGLGFTFGVFPFMDKDPLTEAK 



Sequence description: 

A] Length: 588 bp - 1 96 aa (partial) 

B] This gene sequence was not identified using the LEEP system. It was identified 
downstream of the ID-93 gene which was identified by LEEP, during cloning and 
sequence analysis of the full-length ID-93 gene sequence. No obvious signal peptide, 
but Shine Dalgamo sequence upstream of the ATG start codon. 
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nucSl 

Bgl II Eco RV 
5 ' -cq aqatctqatatc tcaeaaaeaqataacqqcqtaaataq -3 ' 

nucS2 

Bgl II Sma I 

5 * -qa aqatctteceeqqqa tcacaaacaqataacqqcqtaaataq -3 ' 

nucS3 

Bgl II Eco RV 
5 ' -eq aqatctqatate eateaeaaaeaqataacqqcqtaaataq -3 ' 

nucR 

Bam HI 

5 ' -cq qqatcc ttatqqaectqaatcaqcqttqtc -3' 
NucSeq 

5 ' -ggatgctfctgtttcaggtgtatc -3* 
pTREPp 

5 • -catgatatcggtacctcaagctcatatcattgtccggcaatggtgtgggctttttttgttttagcggata 
caatttcacac -3 • 



pTREP R 

5 ' -gcggatcccccgggcttaattaatgtttaaacactagtcgaagatctcgcgaattctcctgtgtgaaatt 
gttatccgcta -3' 



puc F 

5 ' -cgccagggttttcccagtcacgac -3 ' 

FIG. 3 
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V 2 
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pTREP-Nuc vectors allow cloning of genomic DNA into each 
frame with respect to the nuclease gene 



pTREPl-nucl (EcoRV) AAGTATCAGAT CT - - GATATC - - TCACAAACAKATAACGGCGTAAAT Frame=+1 
P TREPl-nuc2 (Sma 1) AAGTATCACATCTTC CCCGGCA - TCACAAACAGAT AAC GGC GTAAAT Frame— t- 2 

::: :::::::::: A I:::::::::::::::::;;;;;; 

pTREPl-nuc3 {EcoRV) AAGTATCAGATCT — GATATTCCATCACAAACAGATAACGGCGT AAAT Fraao=+3 



Nuclease Gone 

Cloning site is indicated bt an arrow 



TCACAAACAGAT AACGGCGT AAAT 
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FIG. 5 

SDS-PAGE analysis of the purified ID-65 and ID-83 protein antigens 
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FIG. 6 



SDS-PAGE analysis of the purified ID-93 antigen 
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FIG. 7 

SDS-PAGE analysis of the purified ID-89 and ID-96 protein antigens 
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FIG. 8 

IgG Titres against the ID-65 and ID-83 proteins 
ID-65 and ID-83 Vaccinations -IgG Titres 
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Survival data 
ID-93 Vaccination- GBS Challenge and Survival 
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FIG. 10 

IgG Titres against the ID-93 protein 
ID-93 Protein Vaccine -IgG Titres 
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FIG. 11 



IgG Titres against the ID-89 and ID-96 proteins 
ID-89 and ID-96 Protein Vaccines -IgG Titres 
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FIG. 12 

Southern blot analysis - rib 
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FIG. 13 

Southern btot analysis - ID-65 
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FIG. 14 

Southern blot analysis - ID-89 
4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 




4 kb 



